Multi-modal multi-class Boosting frame construction method and device for cross-modal retrieval

A construction method and cross-modal technology, applied in the field of information retrieval, can solve the problem of low retrieval performance

Inactive Publication Date: 2016-10-12
HENAN NORMAL UNIV
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] The present invention provides a method and device for constructing a multi-modal multi-class Boosting framework for cross-modal retrieval, aiming to solve the problem of low retrieval performance when the traditional Boosting method is applied to cross-modal retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal multi-class Boosting frame construction method and device for cross-modal retrieval
  • Multi-modal multi-class Boosting frame construction method and device for cross-modal retrieval
  • Multi-modal multi-class Boosting frame construction method and device for cross-modal retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The technical solutions of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0046] Embodiment of the multi-modal and multi-class Boosting framework construction method for cross-modal retrieval of the present invention

[0047] The method for constructing a multi-modal and multi-class Boosting framework for cross-modal retrieval in this embodiment maps data of different modalities to a common semantic space, and properly preserves semantic information within a modal and semantic correlation between modals specific steps include:

[0048] 1) Construct the target risk function R[f 1 ,...,f M ], the objective risk function includes the intra-modal loss of each mode and the inter-modal loss between each mode, where f 1 is the predictor of the first mode, f M is the predictor of the Mth mode, M≥2;

[0049] 2) According to the gradient descent strategy, update the predictor of each mode in the risk function ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to multi-modal multi-class Boosting frame construction method and device for cross-modal retrieval. The multi-modal multi-class Boosting frame construction method for the cross-modal retrieval comprises the steps: constructing a target risk function including intra-mode loss of each mode and inter-mode loss among all modes; according to gradient descent strategy, sequentially updating a predictor of each mode in the risk function, but fixing predictors of other modes; after the predictors of all modes are updated, defining primary cyclic iteration, performing T times of cyclic iteration, and then learning an optimal predictor of each mode, which enables a target function to be minimum; and transferring a fitted edge generated by the optimal predictor of each mode into a common semantic space by utilizing a Sigmoid function, thereby implementing the cross-modal retrieval. The multi-modal multi-class Boosting frame construction method for the cross-modal retrieval considers intra-mode semantic relevance, can reinforce intra-mode semantic information with poor quality to a certain extent, and has relatively greater performance in a cross-modal retrieval task.

Description

technical field [0001] The invention belongs to the field of information retrieval, and in particular relates to a method and device for constructing a multi-modal and multi-class Boosting framework for cross-modal retrieval. Background technique [0002] The core idea of ​​the Boosting classification method is to combine multiple weak classifiers into a strong classifier. This method has been widely studied in computer vision and pattern recognition and other application fields, and has achieved good results. Nevertheless, traditional boosting methods only learn classification rules from a single modality dataset and cannot directly deal with multimodal datasets. In general, traditional boosting methods can be applied to cross-modal retrieval by individually mapping each modality's dataset to the semantic space. However, this scheme does not consider the crucial inter-modal information, which degrades the retrieval performance to some extent. [0003] At present, how to r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 王世勋潘鹏孙林张仕光李源
Owner HENAN NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products