Privacy protection method for associated classification data sequence based on Puffer fish framework

A technology of privacy protection and association classification, applied in the field of privacy protection for association classification data sequences, it can solve the problems of privacy leakage, inapplicability of associated data, lack of achievable mechanisms, etc., and achieve the effect of protecting privacy.

Inactive Publication Date: 2020-04-07
SUN YAT SEN UNIV
View PDF1 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Although differential privacy is a widely used definition of privacy at present, it cannot be applied to the case of data association, because the model assumes that each individual in the data set is independent of each other, which will lead to inconsistency when applying differential privacy directly to the associated data scenario. Cannot meet the privacy definition originally set, resulting in privacy leakage
[0003] Pufferfish's proposal can solve the correlation between data, because it can use the set D to represent the background knowledge owned by all attackers, that is, it can generate all possible probability distributions of the data set, but its shortcoming is that it lacks specific possibilities. The implementation mechanism, because all possible probability distributions need to be considered, the computational complexity is too high, and it is difficult to express all distributions
At present, there are some practical mechanisms for specific data sets, but they can only protect the correlation between a single sequence attribute, and cannot be applied to the scenario provided by the present invention
[0004] The disadvantage of the current privacy protection methods for linked data is that they only consider the relationship of one dimension, such as the relationship between individuals or between attributes.
However, many real data sets are composed of multiple related sequences, such as the time series data of different people. The sequences themselves are highly correlated, and the sequences between different people are also correlated with each other. Therefore, the existing privacy methods for linked data Can not be applied to the scene provided by the present invention

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Privacy protection method for associated classification data sequence based on Puffer fish framework
  • Privacy protection method for associated classification data sequence based on Puffer fish framework
  • Privacy protection method for associated classification data sequence based on Puffer fish framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] A privacy protection method for associated classification data sequences based on the Pufferfish framework can be used for privacy protection of multiple associated classification sequences. The technical solution is divided into three parts. The multidimensional Markov chain model accurately describes the two kinds of correlations between data, and finally proposes an achievable privacy protection mechanism to add appropriate noise protection privacy.

[0032] Part 1: Formulate the definition of privacy protection (Pufferfish privacy protection framework), such as figure 1 shown.

[0033] The Pufferfish privacy protection framework is a generalization of differential privacy proposed in 2014, which can adapt to the correlation between data, and can customize the content of privacy protection according to needs. The Pufferfish framework consists of three parts. Secret S: represents a collection of sensitive information that needs to be protected, that is, S is used to ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a privacy protection method for an associated classification data sequence based on a Puffer fish framework. Firstly, a Puffer fish framework is introduced to formulate a strict privacy protection definition, then, a multi-dimensional Markov chain model is utilized to accurately describe two kinds of relevance between data, and finally, an achievable privacy protection mechanism is proposed to add appropriate noise to protect privacy. According to the method, a privacy protection definition is formulated in a two-dimensional associated data scene based on a puffer fishprivacy protection framework; a multi-dimensional Markov chain model is adopted to reasonably represent two-dimensional relevance between data, a mechanism capable of adding noise is provided in combination with privacy protection definition, and it is guaranteed that the state of each individual at each moment is private data and an attacker cannot distinguish the state while the overall trend of aggregation query analysis is achieved. According to the method, privacy protection of two types of correlation between individuals and correlation in each sequence is considered at the same time, so that the privacy data of the individuals can be protected while the associated data set is available.

Description

technical field [0001] The invention belongs to the field of privacy protection and information security, and more specifically relates to a privacy protection method based on a Pufferfish framework for associated classification data sequences. Background technique [0002] Although differential privacy is a widely used definition of privacy at present, it cannot be applied to the case of data association, because the model assumes that each individual in the data set is independent of each other, which will lead to inconsistency when applying differential privacy directly to the associated data scenario. It cannot meet the privacy definition originally set, resulting in privacy leakage. [0003] Pufferfish's proposal can solve the correlation between data, because it can use the set D to represent the background knowledge owned by all attackers, that is, it can generate all possible probability distributions of the data set, but its shortcoming is that it lacks specific pos...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/62
CPCG06F21/6227G06F21/6245
Inventor 习芷铖桑应朋
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products