Heterogeneous power utilization data publishing method based on clustering anonymization and differential privacy protection

A differential privacy and electricity data technology, applied in the field of information technology security, can solve problems such as restricting differential privacy applications, inability to provide accurate analysis results for privacy-protected electricity data, and loss of availability of published data, achieving flexible privacy-preserving cluster analysis , the effect of ensuring privacy and availability

Pending Publication Date: 2022-02-25
CHINA SOUTHERN POWER GRID DIGITAL GRID RES INST CO LTD +2
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Because, in the non-interactive electricity consumption information collection scenario, when using the noise mechanism of differential privacy to protect the data set, the heterogeneity of the electricity consumption data will cause the noise mechanism to introduce a large number of disturbance errors, making the privacy protection of the electricity consumption data Accurate analysis results cannot be provided during cluster analysis, resulting in the loss of proper availability of published data, which directly restricts the application of differential privacy in non-interactive privacy-preserving data publishing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Heterogeneous power utilization data publishing method based on clustering anonymization and differential privacy protection
  • Heterogeneous power utilization data publishing method based on clustering anonymization and differential privacy protection
  • Heterogeneous power utilization data publishing method based on clustering anonymization and differential privacy protection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail in conjunction with the accompanying drawings.

[0033] The embodiment of the present invention discloses a method for distributing heterogeneous electricity consumption data based on clustering anonymization and differential privacy protection, such as figure 1 As shown, the steps of the method specifically include:

[0034] 1. Data preprocessing.

[0035] First, according to the cluster analysis request of the data user, the original data set D is clustered, and the k-means algorithm or DBSCAN algorithm is applied to obtain the label data set D including the cluster structure and class labels. * , the D * The attributes in the raw data records in are denoted as r * ={A 1 ,...,A d ,Class}, where Class represents each raw data record r i in D * The class labels in the anonymization process can help t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a heterogeneous power utilization data publishing method based on clustering anonymization and differential privacy protection, which relates to the field of information technology security, and aims to achieve the purpose of privacy protection of heterogeneous power utilization data, convert a clustering analysis problem into a classification problem, enabling the cluster structure of the original data to be subjected to generalization anonymity mechanism and noise addition processing at the same time by using the class label, and then publish the power transaction data set satisfying the belonging-differential privacy protection, realize flexible privacy protection cluster analysis, improve the accuracy of the published data for the cluster analysis, and meanwhile, ensure the privacy and availability of various types of data, and provide reliable data for power utilization data analysis.

Description

technical field [0001] The invention relates to the field of information technology security, in particular to a method for publishing heterogeneous electricity consumption data based on anonymization and differential privacy. Background technique [0002] With the rapid improvement of the collection, processing and storage capabilities of the smart grid, the amount of collected electricity consumption data has also increased tremendously. For the various types of electricity consumption data collected, using big data analysis and mining technology, not only can accurately analyze personal electricity consumption, but also provide users with personalized electricity consumption services. However, raw electricity usage data often contain sensitive information about individuals, and publishing electricity usage data directly will lead to personal privacy leakage. Therefore, how to accurately analyze the user's electricity consumption data while protecting the user's privacy f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/762G06K9/62
CPCG06F18/23213
Inventor 奚建飞徐欢雷美炼张锐沈博孙一帆
Owner CHINA SOUTHERN POWER GRID DIGITAL GRID RES INST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products