Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Cross-modal Retrieval Method Based on Recurrent Generative Adversarial Networks

A cross-modal, generative technology, applied in biological neural network models, neural learning methods, other database retrieval, etc., can solve problems such as staying, dependence, and inability to achieve cross-modal retrieval of unlabeled data, and improve efficiency. , the effect of improving accuracy and stability

Active Publication Date: 2021-12-17
WUHAN UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the traditional Internet retrieval services are still at the level of single-modal retrieval, and there are few retrieval applications for cross-modal data. The efficiency, accuracy, and stability of retrieval need to be improved, and most of them rely on existing data labels, it is impossible to achieve cross-modal retrieval of unlabeled data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Cross-modal Retrieval Method Based on Recurrent Generative Adversarial Networks
  • A Cross-modal Retrieval Method Based on Recurrent Generative Adversarial Networks
  • A Cross-modal Retrieval Method Based on Recurrent Generative Adversarial Networks

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Below in conjunction with accompanying drawing and specific embodiment the present invention will be described in further detail:

[0054]In recent years, with the upsurge of artificial intelligence, deep learning technology has gradually emerged and affected many fields of computer science. In the field of multimedia information retrieval technology, more and more people use deep learning to improve the accuracy and accuracy of existing retrievals. stability. The generative adversarial network (generative adversarial network) used in this method is a new type of neural network that has been widely used in recent years to estimate the generative model through the confrontation process. The generator (generator) and The discriminator used to distinguish the authenticity of the data, the generator and the discriminator compete with each other during the training process, and finally reach a dynamic balance. Generative adversarial networks are widely used in many fields s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-modal retrieval method based on a cyclic generative confrontation network. The method designs a novel dual-channel cyclic generative confrontational neural network, and establishes the semantics of cross-modal data by training the neural network. Correlation. Given that different modal data can flow bidirectionally in the network, each modal data generates another modal data through a set of generative adversarial networks, and the generated data is used as the input of the next set of generative adversarial networks, so as to realize the bidirectional flow of data Cyclic generation, the network continuously learns the semantic relationship between cross-modal data. In order to improve the retrieval efficiency, this method also uses the threshold function and approximation function to approximate the result of the middle layer of the generator to the corresponding binary hash code, and designs a variety of constraints to ensure the similarity and The difference of cross-modal and inter-class data further improves the accuracy and stability of retrieval.

Description

technical field [0001] The invention belongs to the technical field of multimedia information retrieval, and in particular relates to a cross-modal retrieval method based on a cyclic generative confrontation network. [0002] technical background [0003] With the advent of the Internet era, people can access massive amounts of information in various modes including pictures, videos, texts, and audios anytime and anywhere. How to obtain the content they need from these massive amounts of information has become the focus of Internet users. Often rely on the precise search services provided by search engines such as Google, Baidu, and Bing. However, most of the traditional Internet retrieval services are still at the level of single-modal retrieval, and there are few retrieval applications for cross-modal data. The efficiency, accuracy, and stability of retrieval need to be improved, and most of them rely on existing The data label of the data cannot achieve cross-modal retrie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/903G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/048G06N3/045G06F18/2413
Inventor 倪立昊王骞邹勤李明慧
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products