Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-modal machine translation data enhancement method based on image description generation

A technology of image description and machine translation, applied in natural language translation, neural learning methods, biological neural network models, etc., can solve the problems of insufficient data enhancement technology, data scarcity, etc., achieve smooth pseudo-data and improve robustness , the effect of improving translation performance

Active Publication Date: 2021-01-22
沈阳雅译网络技术有限公司
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the scarcity of existing multimodal translation training data and the insufficient effects of traditional data enhancement techniques, the present invention proposes a multimodal machine translation data enhancement method based on image description generation, which uses an image description generation model to construct a pseudo data, expand the training data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal machine translation data enhancement method based on image description generation
  • Multi-modal machine translation data enhancement method based on image description generation
  • Multi-modal machine translation data enhancement method based on image description generation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0035] In view of the scarcity of training data in multimodal translation tasks and the poor effect of traditional data enhancement methods, the present invention proposes a multimodal machine translation data enhancement method based on image description generation, using image description generation models to construct pseudo data, Thereby further improving the performance of the translation system.

[0036] like image 3 As shown, a multimodal machine translation data enhancement method based on image description generation in the present invention includes the following steps:

[0037] 1) In the field of image description with large training data, use the pre-trained image coding information and corresponding image description to train the image description generation model based on the attention mechanism;

[0038] 2) Use the trained image de...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-modal machine translation data enhancement method based on image description generation, which comprises the following steps of: training an attention mechanism-based image description generation model by using pre-trained image coding information and corresponding image description; encoding and decoding pictures in the existing multi-modal training data by using the trained image description generation model to generate a corresponding source language image description text; translating the generated source language image description text into a target language, and constructing pseudo data; and adding the constructed pseudo data into the multi-modal training data, fusing the picture information in the multi-modal training data with the source language description information, sending the fused information into a multi-modal machine translation model, and generating a target language translation assisted by the image context information in an autoregressive manner. The diversity of the pseudo data is enriched, the performance can be improved from knowledge refinement, and compared with a common data enhancement method adopting a random replacementmode and the like, the invention has great advantages.

Description

technical field [0001] The invention relates to a machine translation data enhancement technology, in particular to a multimodal machine translation data enhancement method based on image description generation. Background technique [0002] Machine Translation (MT for short) is an experimental discipline that uses computers to translate between natural languages. Using machine translation technology, a source language can be automatically converted into a target language. Machine translation, as a key technology to eliminate barriers in people's cross-language communication, has always been an important part of natural language processing research. Compared with human translation, machine translation is more efficient and lower cost, which is of great significance for promoting national unity and cultural exchanges. Machine translation technology can be summarized as two methods based on rationalism and methods based on empiricism. Since it was proposed in the 1940s, mach...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06F40/58G06N3/08G06V30/40G06N3/045G06F18/251
Inventor 杜权
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products