Text grammar error correction method fusing monolingual data
A grammatical error and error correction technology, applied in the field of text error correction, can solve problems such as grammatical errors
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment 1
[0034] The present invention provides a text grammatical error correction method that integrates monolingual data, specifically:
[0035] A text grammatical error correction method fused with monolingual data, comprising the following steps:
[0036] Step 1: build the reverse grammatical error generation model, and train the reverse grammatical error generation model;
[0037] The step 1 is specifically:
[0038] Build a reverse grammatical error generation model. The input of the model is the correct sentence written in the parallel sentence pair, and the output is a sentence containing grammatical errors in the parallel sentence pair. The reverse grammatical error generation model is exactly the same as the forward grammatical error correction model. The network structure, learning criteria and training method of the given error sentence x=(x 1 , x 2 ,...,x m ) and the corresponding correction sentence y=(y 1 ,y 2 ,...,y n ), the noise-adding probability p(x|y) modele...
specific Embodiment 2
[0058] The present invention uses the back-translation method in neural machine translation to synthesize fake data.
[0059] First, a reverse grammatical error generation model is trained using the seed corpus,
[0060] During training, the source input of the model is the correct corrected sentence written in the "error-correction" parallel sentence pair, and the target output is the sentence containing grammatical errors in the parallel sentence pair.
[0061] Once trained, the inverse model can be used to "translate" a large number of correctly written texts into texts with grammatical errors, thereby constructing pseudo "error-correction" parallel sentence pairs.
[0062] It has been shown in past studies that in the decoding stage of the reverse model, if the Beam Search strategy is directly adopted, there will be insufficient grammatical error diversity in the generated pseudo-source error sentences. Different from the noise-added Beam Search decoding strategy used in ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com