Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic image annotation method based on deep learning and canonical correlation analysis

A typical correlation analysis and image automatic labeling technology, which is applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem that the vocabulary model and top-level feature fusion mechanism are not suitable for automatic image labeling tasks, etc.

Active Publication Date: 2015-04-29
NAVAL AVIATION UNIV
View PDF3 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

From the experimental results, compared with other deep learning models, the effect of this model is better, but there is still a gap compared with the classic automatic image labeling algorithm, because the vocabulary model and the top-level feature fusion mechanism are not suitable for automatic image labeling tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The present invention will be further described in detail below in conjunction with specific examples.

[0053] An image automatic labeling method based on deep learning and canonical correlation analysis, including:

[0054] (1) Extract the underlying feature vector of the image to be labeled to construct the visual feature vector of the corresponding image;

[0055] In this implementation, the underlying feature vectors include color layout description vectors, color structure description vectors, scalable color description vectors, edge histogram description vectors, GIST feature vectors, and visual bag-of-words vectors based on SIFT features.

[0056] The visual bag-of-words vector based on SIFT features is extracted through the following steps:

[0057] (a) calculate the SIFT feature vectors of all images in the model training data set;

[0058] (b) Clustering all SIFT feature vectors to obtain 500 cluster centers;

[0059] (c) Use each cluster center as a visua...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic image annotation method based on deep learning and canonical correlation analysis. The method includes: using a depth Boltzmann machine to extract the high-level feature vectors of images and annotation words, selecting multiple Bernoulli distribution to fit annotation word samples, and selecting Gaussian distribution to fit image features; performing canonical correlation analysis on the high-level features of the images and the annotation words; calculating the Mahalanobis distance between to-be-annotated images and training set images in canonical variable space, and performing weighted calculation according to the distance to obtain high-level annotation word features; generating image annotation words through mean field estimation. The depth Boltzmann machine comprises I-DBM and T-DBM which are respectively used for extracting the high-level feature vectors of the images and the annotation words. Each of the I-DBM and the T-DBM sequentially comprises a visible layer, a first hidden unit layer and a second hidden unit layer from bottom to top. By the method, the problem of 'semantic gap' during image semantic annotation can be solved effectively, and annotation accuracy is increased.

Description

technical field [0001] The invention relates to image automatic labeling and retrieval technology, in particular to an image automatic labeling method based on deep learning and canonical correlation analysis. Background technique [0002] With the growth of image data showing a geometric progression, how to effectively manage and retrieve these image data has become a research hotspot in the construction of information technology. Although the current content-based image retrieval technology has made great progress, and there are also a variety of prototypes, technologies and retrieval products for civilian use, but due to the main problem-the "semantic gap" has not made a fundamental breakthrough, resulting in its retrieval effect and method. Not ideal. To overcome these problems, the best solution is to add text semantic information related to the image content to the image, that is, image annotation. In view of the problems of strong subjectivity and low labeling effic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F18/2155G06F18/214
Inventor 张立民刘凯邓向阳孙永威张建廷
Owner NAVAL AVIATION UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products