Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Index file generation method and device

A technology of indexing files and component weights, applied in the computer field, can solve the problems of high complexity, failure to load, large index files, etc., and achieve the effect of low complexity

Inactive Publication Date: 2019-07-16
ALIBABA (CHINA) CO LTD
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The index file of the traditional LSH retrieval algorithm is large, and a single machine may not be able to load it. It needs to read the index file into the memory of multiple machines through the Hadoop cluster, which is more complicated to implement.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Index file generation method and device
  • Index file generation method and device
  • Index file generation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] Various exemplary embodiments, features, and aspects of the present disclosure will be described in detail below with reference to the accompanying drawings. The same reference numbers in the figures indicate functionally identical or similar elements. While various aspects of the embodiments are shown in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.

[0080] The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration." Any embodiment described herein as "exemplary" is not necessarily to be construed as superior or better than other embodiments.

[0081] In addition, in order to better illustrate the present disclosure, numerous specific details are given in the following specific implementation manners. It will be understood by those skilled in the art that the present disclosure may be practiced without some of the specific details. In some instances, methods, means, componen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an index file generation method and device. The method comprises the following steps of extracting a feature vector of each piece of training data in a training data set; carrying out product quantization processing on the feature vectors of the training data to obtain a class center of the training data set; generating an empty index file according to the class center ofthe training data set; sending the null index file to each task node of a cluster; and obtaining an index file returned by each task node based on the empty index file. According to the method and thedevice, the main node of the cluster generates the empty index file based on product quantification, the empty index file is deployed on each task node of the cluster, each task node processes the data to be processed to obtain the index file, and sends the index file to the main node, so that the method and the device can be suitable for the high-dimensional feature retrieval, and the implementation complexity is relatively lower.

Description

technical field [0001] The present disclosure relates to the field of computer technology, in particular to a method and device for generating an index file. Background technique [0002] In recent years, with the rapid development of multimedia technology and computer network, the number of digital images in the world is increasing at an alarming rate. In order to effectively access and utilize the information contained in these complex images, a technology that can quickly and accurately search and access images is needed, that is, image retrieval technology. With the emergence of large-scale digital image databases, the traditional text-based image retrieval technology relying on manual annotation can no longer meet the growing needs of users, and CBIR (Content Based Image Retrieval, content-based image retrieval) technology came into being pregnancy. The general practice of CBIR is to first extract the features of the image to establish a feature database, so that an i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/182G06F16/13
CPCG06F16/134G06F16/182
Inventor 吉恒杉
Owner ALIBABA (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products