A method and device for obtaining a collection of similar objects

An object collection and object technology, applied in the computer field, can solve the problems of missing, consuming huge time and computing resources, difficult iterative update, etc., and achieve the effect of speeding up computing speed, reducing computing complexity, and improving accuracy

Active Publication Date: 2021-11-02
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Using the Hive SQL distributed method, only objects with a certain same attribute are compared, and most other similar objects are missed
[0005] 2. It consumes huge time and computing resources, making it difficult to update iteratively effectively
[0006] 3. The distribution of data features is uneven, resulting in a decrease in the accuracy of the obtained similar object set

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for obtaining a collection of similar objects
  • A method and device for obtaining a collection of similar objects
  • A method and device for obtaining a collection of similar objects

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0028] Basic introduction of the minimum hash algorithm: set 4 objects, namely object S1, object S2, object S3, and object S4; among them, object S1={a,d}, object S2={c}, object S3={b ,d,e}, object S4={a,c,d}, a, b, c, d, e are all characteristics of the object.

[0029] Then the feature matrix of these four objects is shown in Table 1:

[0030] Table 1 Objects and their...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for acquiring similar object sets, and relates to the technical field of computers. A specific implementation of the method includes: obtaining a set of target objects and a set of objects to be similar; setting a locally sensitive comparison step size r; Feature data, local sensitivity comparison step length r, obtain the similar object set of the target object from the object set to be similar. This implementation method adopts the local sensitivity-minimum hash value algorithm to obtain the similar object set of the target object from the set of similar objects to be similar, which overcomes the Hive SQL distributed method that only compares objects with a certain same attribute and misses most of them. It reduces the complexity of calculation, speeds up calculation, and improves the accuracy of calculation results and the coverage of similar objects.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a method and device for acquiring similar object sets. Background technique [0002] With the development of computer technology, in many cases an object set needs to quickly find its similar set in a large amount of data. For example, in the field of e-commerce, it is necessary to find similar product recommendations among large-scale products based on the products in the user's purchase records to achieve personalized recommendations. Usually, the method of calculating the similarity of pairwise objects is used to obtain similarity sets, but for object sets containing multiple objects, pairwise calculation will consume huge time and computing resources, which is difficult to meet the needs. You can also use the platform distributed computing framework HiveSQL to calculate the similarity between objects. [0003] In the course of realizing the present invention, the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/27G06F16/22
CPCG06F16/2255G06F16/27
Inventor 李陈程程苏珺于海殷大伟赵一鸿
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products