Text correction model training method and device and text correction method and device

A technology for correcting models and texts, applied in text database queries, biological neural network models, unstructured text data retrieval, etc., can solve the problems of reducing the accuracy of model output, not being able to learn the diversity of training data well, and achieve The effect of improving accuracy

Pending Publication Date: 2020-12-15
网易有道信息技术(北京)有限公司
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the process of model training, the target distribution is often used as 0-1 distribution, that is, only the probability corresponding to the only correct target is used when calculating the loss, which will cause the model to fail to learn the diversity of training data well and reduce the accuracy of the model. output accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text correction model training method and device and text correction method and device
  • Text correction model training method and device and text correction method and device
  • Text correction model training method and device and text correction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The principle and spirit of the present application will be described below with reference to several exemplary embodiments. It should be understood that these embodiments are given only to enable those skilled in the art to better understand and implement the present application, rather than to limit the scope of the present application in any way. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

[0055] Those skilled in the art know that the embodiments of the present application may be realized as a system, device, device, method or computer program product. Therefore, the present disclosure may be embodied in the form of complete hardware, complete software (including firmware, resident software, microcode, etc.), or a combination of hardware and software.

[0056] Herein, it should be understood that any number of elements in the drawings is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text correction model training method and device and a text correction method and device.The text correction method and device are used for improving text correction accuracy, and the training method comprises the steps of inputting wrong texts into a text correction model to acquire a correction word vector sequence; based on the correction word vector sequence and the target word vector sequence, obtaining context semantic information representing text fluency corresponding to each correction word vector; based on the context semantic information corresponding to each correction word vector, respectively obtaining a generation probability corresponding to each correction word vector, the generation probability being a probability of generating the correction word vector with the context semantic information; obtaining a first loss value based on the generation probability of each correction word vector and the difference degree of each target word vector; obtaining a second loss value based on the difference degree between the target word vector sequence and the correction word vector sequence; and updating parameters of the text correction model based on a weighted summation result of the first loss value and the second loss value.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to a text correction model training method and device, and a text correction method and device. Background technique [0002] This section is intended to provide a background or context to the implementations of the application that are recited in the claims. The descriptions herein are not admitted to be prior art by inclusion in this section. [0003] At present, common text correction models (such as English composition grammar correction models) all use an end-to-end model framework based on Transformer (machine translation), which includes two parts: Encoder (encoder) and Decoder (decoder). . In the process of model training, the target distribution is often used as 0-1 distribution, that is, only the probability corresponding to the only correct target is used when calculating the loss, which will cause the model to fail to learn the diversity of tr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F40/284G06F40/30G06K9/62G06N3/02
CPCG06F16/3344G06F40/30G06F40/284G06N3/02G06F18/214
Inventor 王吉平付凯方美媛黄瑾段亦涛
Owner 网易有道信息技术(北京)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products