A data normalization method, device and medium for identity recognition

A kind of identification and data technology, applied in the direction of electronic digital data processing, structured data retrieval, database design/maintenance, etc., can solve the problems of scattered data, insufficient data coverage, no unified identification features, etc., to improve accuracy, Solve the effect of inaccurate and incomplete identity normalization

Active Publication Date: 2022-05-20
XIAMEN MEIYA PICO INFORMATION +1
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the era of mobile Internet, massive amounts of data are generated every day, such as accommodation, driving, travel in real life, instant messaging in the virtual world, third-party payment, etc.; these data are large in volume and have no unified identification characteristics, resulting in various types of data Scattered and irrelevant, how to automatically analyze and normalize the identities of relevant data has become a difficulty in improving the analysis ability and efficiency of massive data
[0003] Since the data is constantly increasing with the increase of various application types, and there is no unified identification feature, the existing identity normalization methods currently on the market mainly use manual configuration to judge the relationship between data sources one by one. These technologies cannot meet the complex analysis needs in reality, and their technical defects are as follows:
[0004] 1) The manual configuration method requires a lot of business research time, and is prone to errors and omissions, which greatly affects the efficiency and quality of data analysis work;
[0005] 2) Single matching rule: In many cases, the data cannot be related only by a single rule, resulting in insufficient coverage of data that can be related in the end, which seriously affects the use effect and user experience of the system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data normalization method, device and medium for identity recognition
  • A data normalization method, device and medium for identity recognition
  • A data normalization method, device and medium for identity recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0044] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0045] figure 1 A data normalization method for identification of the present invention is shown, the method includes:

[0046] Extracting step S101, extracting identity attribute information contained in data records from multiple data sources, and constructing a correspon...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a data normalization method, device and medium for identity recognition. The method first constructs a corresponding identity attribute data set; then judges whether there is a matching rule for identifying the identity attribute data set, and if so, uses the rule matching method to match Identify the identity attribute data set, if not, use the path matching method to identify the identity attribute data set; then calculate the credibility of at least two data records in the obtained identification results, if the credibility reaches a certain threshold, then The at least two data records are stored in the database after being normalized. According to the different characteristics of the data records, the present invention adaptively selects whether to use the rule matching algorithm or the path matching algorithm, can quickly normalize the identities conforming to the rule characteristics, and can more comprehensively normalize the identities without obvious consistent characteristics , this method will greatly improve the accuracy of identity normalization, and a rule matching algorithm and a path matching algorithm are proposed.

Description

technical field [0001] The invention relates to the technical field of computer data processing, in particular to a data normalization method, device and storage medium for identification. Background technique [0002] In the era of mobile Internet, massive amounts of data are generated every day, such as accommodation, driving, travel in real life, instant messaging in the virtual world, third-party payment, etc.; these data are large in volume and have no unified identification characteristics, resulting in various types of data Scattered and irrelevant, how to automatically analyze and normalize the identities of related data has become a difficulty in improving the ability and efficiency of massive data analysis. [0003] Since the data is constantly increasing with the increase of various application types, and there is no unified identification feature, the existing identity normalization methods currently on the market mainly use manual configuration to judge the rela...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/21G06F16/2458
CPCG06F16/21G06F16/2465
Inventor 周成祖叶立震鄢小征林文楷魏超许琨
Owner XIAMEN MEIYA PICO INFORMATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products