Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Similar crowd extension algorithm based on locality sensitive Hash algorithm

A locally sensitive hash and extended algorithm technology, applied in the field of information processing, can solve the problems of high cost of maintenance and improvement, large amount of calculation, difficult cold start, etc., so as to improve the speed and accuracy of calculation, reduce the cost of calculation, and reduce the The effect of computation

Pending Publication Date: 2021-08-20
上海垚亨信息科技有限公司
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The idea of ​​the common Look-alike algorithm is as follows: first, based on simple similarity calculations, such as Cosine (continuous value) or Jaccard (01 value), by calculating the similarity between two users, start from the seed user to find similar For users, the logic of this method is simple, but the amount of calculation is large, which makes the calculation cost high and the calculation accuracy is poor
The second method is based on logistic regression for supervised binary classification prediction. This method only requires linear calculations, and the algorithm complexity is low during online prediction. However, this method has difficulties in negative sample sampling and cold start. and other problems, resulting in poor calculation accuracy
The third is a segment-based approximate search system, that is, to tag users, aggregate user groups through tags and give candidate marketing targets. A mature tag system can bring better marketing effects, and it is very simple and easy to apply online. Fast, but this kind of system needs to spend a lot of resources to mine tags in advance. At the same time, the cost of later maintenance and improvement of the system is relatively high, and the calculation accuracy is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Similar crowd extension algorithm based on locality sensitive Hash algorithm
  • Similar crowd extension algorithm based on locality sensitive Hash algorithm
  • Similar crowd extension algorithm based on locality sensitive Hash algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] A similar group extension algorithm based on locality-sensitive hashing algorithm of the present invention will be further described in detail below. The invention will now be described in more detail with reference to the accompanying drawings, in which preferred embodiments of the invention are shown, it being understood that those skilled in the art may modify the invention described herein and still achieve the advantageous effects of the invention. Therefore, the following description should be understood as the broad knowledge of those skilled in the art, but not as a limitation of the present invention.

[0042] In the interest of clarity, not all features of an actual implementation are described. In the following description, well-known functions and constructions are not described in detail since they would obscure the invention with unnecessary detail. It should be appreciated that in the development of any actual embodiment, numerous implementation details ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a similar crowd extension algorithm based on a locality sensitive Hash algorithm. An open source tool datasketch is adopted to calculate original data features so as to obtain weighted minimum Hash of feature vectors of all users, the calculation amount can be greatly reduced, the calculation speed and accuracy are improved, and meanwhile the calculation cost is reduced; and in addition, a local sensitive Hash model constructed by using an open source tool datasketch can be obtained according to the memory size of the memory and the requirement of calculation accuracy, so that the accuracy of the local sensitive Hash model is high.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to an extended algorithm for similar groups of people based on a local sensitive hash algorithm. Background technique [0002] In digital marketing, how to quickly and accurately find target customer groups is a challenging task in the Internet age. Look-alike is a general term for algorithms to find the most similar groups of people starting from seed users. As an important algorithm in the field of advertising, it can help advertisers efficiently target marketing groups. [0003] The idea of ​​the common Look-alike algorithm is as follows: first, based on simple similarity calculations, such as Cosine (continuous value) or Jaccard (01 value), by calculating the similarity between two users, start from the seed user to find similar For users, this method is logically simple, but the amount of calculation is large, which makes the calculation cost high and the calcul...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/51G06F16/9535G06Q30/02
CPCG06F16/51G06F16/9535G06Q30/0271
Inventor 葛永昌
Owner 上海垚亨信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products