Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Bidirectional coding target encoder construction method and device

A target encoding and encoding technology, applied in the field of pre-training models, can solve problems such as unproposed solutions and reduced translation accuracy

Pending Publication Date: 2021-02-05
SHANGHAI MININGLAMP ARTIFICIAL INTELLIGENCE GRP CO LTD
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although such a transfer learning architecture based on self-encoded feature extraction acquires contextual features at the same time, its essence is one-way text input. Features are extracted from the context at the same time to reflect the "two-way" process. Therefore, when Transformer performs machine translation, along with As the text length increases, when translating from left to right, the translation accuracy at the end of the sentence continues to decrease
[0004] For the above problems, no effective solution has been proposed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bidirectional coding target encoder construction method and device
  • Bidirectional coding target encoder construction method and device
  • Bidirectional coding target encoder construction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

[0025] In the following description, use of suffixes such as 'module', 'part' or 'unit' for denoting elements is only for facilitating the description of the present application and has no specific meaning by itself. Therefore, "module" and "component" may be mixedly used.

[0026]In related technologies, the technical solution of tran...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a bidirectional coding target encoder construction method and device. The method comprises the steps of obtaining a training text; encoding the training text into a first sequence according to the encoding relationship stored in the dictionary, wherein the encoding sequence of the first sequence is consistent with the text sequence of the training text; covering each element in the first sequence in sequence according to a character sequence of the training text to obtain a plurality of second sequences; rearranging the elements in each second sequence according to a sequence opposite to the current arrangement sequence to obtain a plurality of third sequences; and inputting the second sequence and the third sequence into a self-encoding language model, and outputting the model as a target encoder. When the target encoder is constructed, feature extraction training is performed by adopting forward encoding input and reverse encoding input, so that the feature representation capability of the encoder is improved through forward and reverse bidirectional encoding in a real spatial sense, and the technical problem that the translation accuracy at the end of asentence is continuously reduced is solved.

Description

technical field [0001] The present application relates to the technical field of pre-training models, and in particular to a method and device for constructing a bidirectional encoding target encoder. Background technique [0002] The language model is widely used in various natural language processing tasks, and its essence is equivalent to an encoder, which can effectively extract important feature information from the original text. In recent years, the research on language models is based on the connection of the frequently used Word2Vec to the neural network, and then the use of the bidirectional LSTM network and the Text-CNN network. The feature extractor with the best performance in recent years is the Transformer neural network. Therefore, transfer learning based on Transformer's pre-training-fine-tuning architecture has also become a trend, and its core is the pre-trained language model implemented by Transformer. This transfer learning method makes it more convenie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/126G06K9/62G06N3/04
CPCG06F40/126G06N3/045G06F18/214Y02T10/40
Inventor 徐成国杨康周星杰王硕
Owner SHANGHAI MININGLAMP ARTIFICIAL INTELLIGENCE GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products