Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Picture description generation method and system based on Actor-Critic generative adversarial network

A picture description and generation system technology, applied in the direction of biological neural network model, neural learning method, character and pattern recognition, etc., can solve the problems of rough text quality and not meeting expectations, so as to increase diversity and creativity, improve The effect of high performance and training efficiency

Pending Publication Date: 2020-05-15
ZHEJIANG UNIV
View PDF5 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] SeqGAN solves the training problem of discrete data through reinforcement learning, and provides a set of methods for GAN in the field of text generation, but many experimental results show that the quality of the text generated by the SeqGAN model is rough, far below expectations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Picture description generation method and system based on Actor-Critic generative adversarial network
  • Picture description generation method and system based on Actor-Critic generative adversarial network
  • Picture description generation method and system based on Actor-Critic generative adversarial network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0071] The architecture of the picture text description generation system of the present invention is as follows: figure 1 As shown, it includes feature extraction module, generation module, discrimination module and Actor-Critic module, and its workflow is as follows figure 2 As shown, the details are: after obtaining the data, perform data processing, including operations such as establishing word vectors and data enhancement, to obtain the data required for subsequent model building. An image is extracted through the HCP (Hypotheses-CNN-Pooling) feature extraction module to extract high-level semantic features, which are used to initialize the generation module to obtain text descripti...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a picture description generation method and system based on an Actor-Critic generative adversarial network, and the method comprises the following steps: (1) obtaining a picture described by a known text, carrying out the preprocessing, and constructing a training set, (2) establishing a target network based on a generative adversarial network and an Actor-Critic algorithm,wherein the target network comprises a generator network, a discriminator network and a Critic network, (3) inputting the pictures in the training set and the text description of the pictures into the target network, performing pre-training and adversarial training on the generator and the discriminator, and performing single-step updating on the parameters of the generator by adopting an Actor-Critic algorithm, and (4) inputting the target picture needing to generate the text description into the trained generator to obtain the text description of the target picture. According to the method,based on the Actor-Critic algorithm, an adversarial network technology is adopted, and diversified text description can be generated on a given image.

Description

technical field [0001] The present invention relates to the fields of reinforcement learning, computer vision and natural language processing, in particular to a method and system for generating picture descriptions based on Actor-Critic generative confrontation networks. Background technique [0002] Image summarization (Image Caption) is an important problem in computer vision. [0003] In traditional image summarization tasks, the commonly used model is the Encoder-Decoder (encoding-decoding) model. In general, Encoder uses Convolutional Neural Networks (CNN) to extract image features, while Decoder uses Recursive Neural Network (RNN) to generate summaries from image features. This is because CNN performs well in extracting image features, while RNN has a memory ability for input and is outstanding in text generation. [0004] Proposed in a paper published by Qi Wu (Q.Wu, C.Shen, A.van den Hengel, L.Liu, and A.Dick.What Value DoExplicit High-Level Concepts Have in Visio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/58G06N3/04G06N3/08G06K9/62
CPCG06F16/5866G06N3/08G06N3/044G06N3/045G06F18/214
Inventor 陈建海李天旭季博楼智豪袁嘉琪何钦铭
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products