Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An Address Desensitization Method Preserving Distribution Characteristics

A technology of distribution characteristics and desensitization, applied in the field of data processing, can solve the problems of low security, limited analysis value, large address distribution granularity, etc., to achieve the effect of strong analyzability, short desensitization time, and high analysis value

Active Publication Date: 2022-04-08
ZHEJIANG UNIV OF TECH
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, while desensitizing the address information, the analysis value of the address information should be preserved as much as possible. Methods a and b do effectively desensitize the address information, but the retained analysis value is also very limited. The granularity of address distribution If it is too large, it is difficult to dig deeper. If you adjust the masking or generalization granularity of desensitization, for example, only masking to the "road" level, that is, "rockery road, Gongshu District, Hangzhou City, Zhejiang Province**", although the analysis value It is well preserved, but it is also easy for people to "guess" the individual in reality by contacting other information, so the degree of desensitization is too low and the security is low
Method c can achieve desensitization effect by randomly shuffling addresses, but because of its randomness, the desensitization results of the same data source are different every time, which does not conform to the consistency of the desensitization process, and random The result of shuffling may not completely retain the distribution characteristics of the original address information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Address Desensitization Method Preserving Distribution Characteristics
  • An Address Desensitization Method Preserving Distribution Characteristics
  • An Address Desensitization Method Preserving Distribution Characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The invention will be further described below with reference to the accompanying drawings.

[0036] Refer Figure 1 ~ 5 A address desensitization method for retention distribution characteristics, the method comprising the steps of:

[0037] (1) Collect the address information to be desensitized and collected into the desensitization data set M1;

[0038] (2) The address information data set to be desensitized is compliant, and the unqualified address information is placed in an abnormal data set, and the compliance to desessive data set M2 is generated.

[0039] (3) Made the first phase of the first phase of the first phase to be mixed to be mixed, generated to be mixed data set M3:

[0040] Use the regular expression to find the location of the numbers in the data to be deesed, and then replace this number with "*";

[0041] (4) Generate a uniform random sequence according to the linear feedback shift register (LFSR):

[0042] Linear Feedback Shift Register (LFSR) consists...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An address desensitization method that retains distribution characteristics, comprising the following steps: (1) collecting address information to be desensitized, and grouping it into a data set M1 to be desensitized; (2) performing a compliance check on the address information data set to be desensitized , put the unqualified address information into the abnormal data set, and generate the compliance data set M2 to be desensitized; (3) perform the first stage digital masking process on the compliance data set to be desensitized, and generate the data to be shuffled Set M3; (4) generate a uniform random sequence L1 according to the linear feedback shift register; (5) remove the number greater than the length M3 of the data set to be desensitized in the uniform random sequence generated in (4), to generate a shuffled sequence L2; (6) Shuffle the address data in M3 with the number in sequence L2 as the position index. The desensitization result of the invention well retains the distribution characteristics of the address data and has strong analyzability; the desensitization result can retain good consistency; the desensitization time is short and has high efficiency.

Description

Technical field [0001] The present invention relates to the field of data processing, and more particularly to a process desensitization method for reserving distribution features. Background technique [0002] In the era of big data, information is unquestionable "wealth." Regardless of the government unit or business, or every person, every day is interested in unintentional collection, storage, sharing data, and larger scale. This information is an invisible asset. But it is precisely because we have more and more data, our leaks are naturally getting higher and higher. In recent years, my country has strengthened the management and research of user data information protection. National convened a number of conferences, big data and data security that have been multi-data, data security, have become popular topics for banks and Internet companies. Guo Jiakai's "Data Deferribly: Sensitive Data Security Guard" proposed the important role of data desensitization in information se...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F21/62
CPCG06F21/6245
Inventor 孟利民梁泽楷应颂翔林梦嫚蒋维
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products