Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Medical autocorrelation time series data differential privacy release method

A time-series data and differential privacy technology, applied in digital data protection, electrical digital data processing, instruments, etc., can solve the problems of long length of noise-added time-series data, incapable of quantitative analysis of privacy protection level, and no processing method proposed

Pending Publication Date: 2020-10-20
WUHAN UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these models have two shortcomings. First, these models cannot provide sufficient security guarantees, and the models need to be continuously improved for the emergence of new attacks.
Second, these models cannot strictly prove the level of privacy protection, and when the model parameters change, the level of privacy protection cannot be quantitatively analyzed
The length of the noise-added time series data processed in this way is much longer than the original data, and no further processing method is proposed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical autocorrelation time series data differential privacy release method
  • Medical autocorrelation time series data differential privacy release method
  • Medical autocorrelation time series data differential privacy release method

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0087] In the following, the specific implementation steps of the present invention will be described in detail by taking the blood glucose data of the human body in the medical time-series data as an example. A hospital wants to publish the blood sugar content of people in the area for a period of time to check the impact of the current season's diet on people's blood sugar levels, but it cannot release the real value, otherwise it may cause the leakage of personal real information, which will affect personal life . For example, when buying insurance, if the insurance company knows your blood sugar level in the recent period, if it has been high, you may have a higher probability of health risks, and you will need to increase the premium. It is known that the medical blood glucose data to be released by the hospital includes blood glucose data of 2,000 people, 92 days from November to January, and blood glucose is measured 4 times a day for a total of 368 time points. That i...

no. 2 example

[0154] For the time-series data that has just been added to the database recently, for example, the hospital has added the blood glucose data collected by everyone in the first week of February. If you want to release the blood glucose data in the first week of February, you need to perform the following steps .

[0155] Step 1: Construct personal medical time series data;

[0156] The personal medical time series data described in step 1 is:

[0157] x i ={d i,(j-1)*T+k}

[0158] i∈[1,N]

[0159] j∈[1,K]

[0160] k∈[1,T]

[0161] Among them, X i is the medical time series data of the i-th person, d i,(j-1)*T+k is the medical data of the kth point in the jth sampling period of the i-th person, N=2000 is the number of individuals, K=7 is the number of sampling periods, T=4 is the number of sampling points in the sampling period, K, T Based on actual medical data;

[0162] Step 2: Divide the personal medical time-series data into multiple disjoint and equal-length pers...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a medical autocorrelation time series data differential privacy release method. The method comprises the following steps: firstly, constructing personal medical time series data; dividing the personal medical time series data into a plurality of disjoint and equal-length personal medical time series data sub-sequences through a set sliding window; calculating a normalized autocorrelation function of the sub-sequence according to the sub-sequence, and calculating the periodic sensitivity of the sub-sequence according to a query function; and generating Laplace noise sequences corresponding to the sub-sequences, splicing all the noise sequences to obtain Laplace noise sequences corresponding to the sub-sequences, and adding the Laplace noise sequences to the original time series data to obtain a final to-be-released result. According to the method, the privacy of the user is protected from being leaked to a greater extent, and recently-arrived medical time series data can be processed. According to the invention, on the premise of ensuring that the relevancy of the released data is the same as that of the original data, the security of medical time series dataprivacy protection and the time series data processing efficiency can be effectively improved.

Description

technical field [0001] The invention belongs to the field of medical data protection, and in particular relates to a method for publishing differential privacy of medical autocorrelation time series data. [0002] technical background [0003] Due to the purpose of teaching and scientific research, medical institutions often need to collect and release a large amount of various medical sensitive data. Time-series data in medical datasets refers to a series of data values ​​observed at consecutive time stamps. Using the correlation between data values ​​to analyze and mine time series data can bring huge benefits to the government, enterprises and social public services, such as in disease monitoring, monitoring and analyzing the physical condition of patients can prevent the outbreak of certain diseases . [0004] The above example shows that publishing medical time series data is of great significance to knowledge discovery and acquisition. However, if the data holders di...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62
CPCG06F21/6254G06F21/6245
Inventor 刘树波赵晶蔡朝晖涂国庆
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products