Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for generating image through text with introduced class information

A technology for generating images and texts, applied in the field of text-generated images that introduce class information, it can solve the problem of incomplete description of a single text, and achieve the effect of enriching image semantics and reducing training difficulty.

Pending Publication Date: 2021-05-07
SOUTHEAST UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The present invention aims at the deficiencies of the above-mentioned existing image generation methods, and provides a text generation image method and device that introduces class information, which can introduce the class information to which the text belongs during the image generation process, and use the class information to constrain the same class of text to generate pictures Relevance, while solving the problem that a single text description may not be comprehensive

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating image through text with introduced class information
  • Method and device for generating image through text with introduced class information
  • Method and device for generating image through text with introduced class information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The present invention will be further illustrated below in conjunction with the accompanying drawings and specific embodiments, and it should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. After reading the present invention, modifications to various equivalent forms of the present invention by those skilled in the art fall within the scope defined by the appended claims of the present application. A text-to-image method that introduces class information based on conditional generative adversarial text, such as Figure 1-2 shown, including the following steps:

[0040] Step 1, build a text encoding unit, including a text encoder and a cyclic neural network transcoder, such as figure 1described in S1. The text encoder inputs natural language text and outputs an embedded representation of the text. The natural language text here uses English text. After removing the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for generating an image by a text with introduced class information, the method for generating the image by the text with introduced class information comprises a training stage and a testing stage, the training stage is based on a generative adversarial network, and a natural language text describing an image, the class label of the text, a corresponding real image training generator and a discriminator are utilized; and in the test stage, a corresponding image is generated in a generator by using the text and the class label thereof. The method and device have the advantages that text semantic image features and class information image features are generated through respective transcoding according to text information codes and class information codes, then the two levels of image features are fused to decode and generate the image, corresponding class information is introduced in the image generation process to enhance the correlation between the generated image and the text, and meanwhile, through the multi-stage generation process in the training process, the higher-resolution image is gradually generated, and the training difficulty of directly generating the high-resolution image is reduced.

Description

technical field [0001] The invention relates to the technical field of deep learning generation models, in particular to a method and device for generating images from texts by introducing class information. Background technique [0002] Image generation from text is an important problem and has wide applications, such as computer-aided medicine, news image generation, etc. [0003] The research on image generation methods from text is mainly based on two generative models, Conditional Variational Auto-Encoder (CVAE for short) and Conditional Generative Adversarial Networks (CGAN for short). Among them, the pictures generated by the CVAE method often have the problem of blurred pictures. Now the mainstream methods are all based on the CGAN model. [0004] Due to the instability of GAN's own training, it is very difficult to directly generate high-resolution images from text descriptions. Therefore, the hierarchical generation confrontation network (StackGAN) proposes a meth...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/58G06F40/289G06F40/30G06N3/04G06N3/08G06T3/40
CPCG06F16/3344G06F16/5866G06F40/289G06F40/30G06N3/08G06T3/4053G06N3/045
Inventor 周德宇孙凯胡名起
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products