Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-modal retrieval method and device, network training method and device, equipment and medium

A training method and cross-modal technology, applied in the computer field, can solve problems such as poor retrieval effect and insufficiently refined features, so as to achieve the effect of identifiability and improvement of cross-modal retrieval performance.

Pending Publication Date: 2022-07-29
BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present disclosure provides a cross-modal retrieval method, network training method, device, equipment, and medium to at least solve the problem that the features output by the retrieval network in the related art are not fine enough, and only coarse-grained judgments about whether the features are related are available. The problem of poor retrieval effect of fine-grained cross-modal retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal retrieval method and device, network training method and device, equipment and medium
  • Cross-modal retrieval method and device, network training method and device, equipment and medium
  • Cross-modal retrieval method and device, network training method and device, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0112] In order to make those skilled in the art better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.

[0113] It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a cross-modal retrieval method and device, a network training method and device, equipment and a medium. The cross-modal retrieval method comprises the steps of obtaining to-be-retrieved data and candidate data; the data to be retrieved and the candidate data correspond to different modes; extracting a first feature of the to-be-retrieved data and a second feature of the candidate data based on a cross-modal retrieval network; and retrieving data matched with the to-be-retrieved data from the candidate data according to the matching degree of the first feature and the second feature. By adopting the cross-modal retrieval network disclosed by the invention, the input local information of the to-be-retrieved data and the candidate data can be accurately captured, more effective features are output, and the more effective features are more distinctive in the same mode and are more identifiable in different modes, so that the fine-grained cross-modal retrieval performance is improved.

Description

technical field [0001] The present disclosure relates to the field of computers, and in particular, to a cross-modal retrieval method, a network training method, an apparatus, a device and a medium. Background technique [0002] Cross-modal retrieval refers to a retrieval method in which the modalities of the retrieval results and the modalities of the query data are different. For example, use images to retrieve text, video, audio, etc. [0003] In the related art, cross-modal retrieval usually performs similarity calculation on the features of different modalities output by the retrieval network, obtains a correlation score, and performs retrieval according to the correlation score. When training the retrieval network, the two modal data are usually mapped into the high-dimensional representation space of the same dimension, and then the features of the two modalities are obtained and then the comparison loss function is directly used for training. The features output by...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/903G06N3/04G06N3/08
CPCG06F16/90335G06N3/08G06N3/045
Inventor 何永明李涛梅丰
Owner BEIJING DAJIA INTERNET INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products