Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Approximate data processing method and device, medium and electronic equipment

A technology for approximate data and processing methods, applied in the field of data processing, can solve problems such as instability, high requirements for mapping results, and uncertain time spent

Pending Publication Date: 2020-06-05
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It includes the random numbers in the Hash function, so the effect of mapping different heaps has a great relationship with the given random numbers. When using the mapping results based on the LSH algorithm for subsequent data processing tasks, if a large amount of data needs to be processed , it will have high requirements on the mapping results, which will cause certain instability
On the one hand, if the amount of data in the bucket is too large, the effect of using the LSH algorithm to improve efficiency will be greatly reduced; on the other hand, for the same set of data, the time it takes to execute data processing tasks will be uncertain, depending on the amount of data in the heap. size effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Approximate data processing method and device, medium and electronic equipment
  • Approximate data processing method and device, medium and electronic equipment
  • Approximate data processing method and device, medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatuses and methods consistent with aspects of the invention as recited in the appended claims.

[0034] Furthermore, the drawings are merely schematic illustrations of the present disclosure and are not necessarily drawn to scale. The same reference numerals in the drawings denote the same or similar parts, and thus repeated descriptions thereof will be omitted. Some of the block diagrams shown in the drawings are functional entities and do not necessarily correspond to physically or logic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of data processing and discloses an approximate data processing method and device, a medium and electronic equipment. The method comprises the steps of obtaining to-be-processed data; obtaining a vector corresponding to the to-be-processed data; performing hash operation on the vector of the to-be-processed data by utilizing each position sensitive hash functionin a position sensitive hash function family to obtain a mapping value corresponding to the vector of the to-be-processed data; repeating the step of constructing coverage groups for a first predetermined number of times to obtain a plurality of coverage groups, wherein the step of constructing the coverage groups comprises the construction of the coverage groups based on mapping value corresponding to the vector of the to-be-processed data and a position sensitive hash function for performing hash operation on the vector of the to-be-processed data; integrating a plurality of coverage groupsto obtain final coverage to which the to-be-processed data belongs, and taking the to-be-processed data belonging to the same final coverage as approximate data. According to the method, the situationthat time consumed for processing a large amount of approximate data is not stable is avoided, and the data processing efficiency is improved on the whole.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular to an approximate data processing method, device, medium and electronic equipment. Background technique [0002] At present, in data processing, for any piece of data, in order to quickly find data similar to the data, the commonly used scheme is location sensitive hashing (Locality sensitive Hashing, LSH), which maps high-dimensional data to low-dimensional Data, mapping similar data into the same bucket, can make the probability that two data points adjacent to each other in the original data space are still very high in the new data space after mapping, while two data points that are not adjacent The probability of points being adjacent in the new data space after mapping is very small. However, the use of the LSH algorithm involves the given of several hyperparameters. It includes the random numbers in the Hash function, so the effect of mapping different ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/22G06F16/28
CPCG06F16/2255G06F16/285
Inventor 冯晨王健宗彭俊清
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products