Large-scale image library retrieval method based on self-adaptive bit allocation Hash algorithm

A hash algorithm, image library technology, applied in computing, computer parts, character and pattern recognition, etc., can solve the problems of large storage space and slow retrieval speed of image feature library

Active Publication Date: 2014-09-03
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Therefore, the present invention can solve the problem of large storage space and slow retrieval speed in the image feature library for massive image retrieval, and overcomes the shortcomings of LSH, SH, and PCAH methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Large-scale image library retrieval method based on self-adaptive bit allocation Hash algorithm
  • Large-scale image library retrieval method based on self-adaptive bit allocation Hash algorithm
  • Large-scale image library retrieval method based on self-adaptive bit allocation Hash algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the purpose, technical solutions and advantages of the present invention clearer, the specific implementation manners of the present invention will be described in detail below in combination with the technical solutions and accompanying drawings. Take the FLICKR1M [Mark J. Huiskes, Michael S. Lew. The MIR Flickr retrieval evaluation. In Proceedings of ACM International Conference on Multimedia Information Retrieval, 2008] dataset as an example. FLICKR1M contains 1 million images, all downloaded from the Flickr website, of varying content and size.

[0060] figure 1 The left part of represents the main flowchart of the training process of the embodiment of the present invention, as shown in the figure,

[0061] Step S11: Divide the FLICKR1M dataset into two parts: the image database D to be retrieved (995,000 images) and the query image set Q (5,000 images). And randomly select 100,000 pictures from the image library to be retrieved as the training se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a large-scale image library retrieval method based on a self-adaptive bit allocation Hash algorithm. The method comprises the following steps: selecting a part of images as a training set from an image library to be retrieved, and extracting a GIST characteristic of the training set; projecting the characteristic data of the training set by using principal component analysis (PCA), and calculating the dispersion of each dimension of training data; according to the dispersion of different dimensions, allocating different bits to encode the data in a self-adaptive manner; obtaining a sub-code according to the code length of each dimension and each dimension of a threshold code, and splicing complete codes of the data in pair; corresponding to the processing and training process of a checked image and the characteristic data in the image library to be retrieved, respectively calculating Hash codes of the image to be retrieved and the characteristics of the checked images; calculating the Hamming distance of the Hash codes, thereby rapidly retrieving similar images. The method is high in universality, the neighbor structure of original characteristic data can be well maintained, and as the data are encoded by using a Hash method, the storage space of data is reduced, and the retrieval efficiency in checking is improved.

Description

technical field [0001] The invention belongs to the technical field of image retrieval, and relates to a content-based image retrieval method, in particular to a large-scale image library retrieval method based on an adaptive bit allocation hash algorithm. Background technique [0002] With the increasing amount of image data on the Internet, how to quickly and accurately provide users with required image resources is becoming more and more important. Content-based image retrieval (content-based image retrieval, CBIR) can solve this problem better, so it has attracted the attention of many researchers. The existing retrieval methods describe the content of the image by extracting the underlying features of the image, and then use feature comparison to judge whether it is a similar image. Therefore, CBIR mainly includes two core research contents, one is effective image feature representation, and the other is efficient retrieval algorithm. The invention mainly solves the p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/5838G06F18/21
Inventor 郭勤振曾智张树武
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products