Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese text automatic correction method

An automatic correction, text technology, applied in special data processing applications, instruments, electronic digital data processing and other directions, can solve the problem that the cost and efficiency cannot adapt to the rapid increase in the number of electronic texts, and achieve fast error checking and high error correction efficiency. Effect

Inactive Publication Date: 2016-01-27
SHANGHAI INST OF TECH
View PDF2 Cites 63 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

From the perspective of long-term development, informatization is the trend of social development in the future. People are faced with more and more electronic information and manuscripts. Traditional manual proofreading requires proofreaders to read and check the text word by word, which is cost-effective and efficient. Cannot adapt to the trend of rapid growth in the number of electronic texts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese text automatic correction method
  • Chinese text automatic correction method
  • Chinese text automatic correction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0023] figure 1 It is a schematic diagram of the Chinese text automatic correction process of the present invention.

[0024] See figure 1 , the Chinese text automatic correction method that the present invention provides, comprises the steps:

[0025] a) input the Chinese text to be proofread, and carry out word segmentation preprocessing to the Chinese text by a single sentence; adopt voice or keyboard to input the Chinese text to be proofread, and the preprocessing includes sorting grammatical errors and pattern matching checks to the input Chinese text to be proofread, to be Proofreading Chinese text can be input by voice or keyboard. The keyboard input process is as follows: image 3 Shown: the words are encoded in advance, the keystroke signal is converted into a code sequence accepted by the computer, and the code sequence is associated with t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese text automatic correction method. The method comprises the following steps of: a) inputting a to-be-corrected Chinese text, and performing word segmentation preprocessing on the Chinese text sentence by sentence; b) searching for one-character words, two-character words or disperse strings of three or more than three characters occurring in the text subjected to word segmentation sentence by sentence; c) performing continuous determination on the disperse strings occurring in the text subjected to word segmentation by adopting an N-gram model, and checking text word level errors for each single sentence in combination with a word forming probability of separate characters; and d) constructing an error correction knowledge base to generate an error correction candidate text. According to the Chinese text automatic correction method provided by the invention, the one-character words, two-character words or disperse strings of three or more than three characters occurring in the text subjected to word segmentation are searched for sentence by sentence, the disperse strings occurring in the text subjected to word segmentation are subjected to continuous determination by adopting the N-gram model to determine identification errors, and the error correction knowledge base is constructed to generate the error correction candidate text, so that error checking and correcting processes are combined very well, and the method has the characteristics of high error checking speed and high error correcting efficiency.

Description

technical field [0001] The invention relates to a text correction method, in particular to a Chinese text automatic correction method. Background technique [0002] With the rapid development of modern laser phototypesetting technology and electronic publishing industry, how to ensure the correctness of the information conveyed has become one of the important aspects of research. At present, when people use computers for writing, editing, and typesetting, some text errors will inevitably occur, such as multiple words, missing words, transposition, English word spelling errors, and irregular punctuation. Therefore, a special school team system is required to proofread the manuscript. From the perspective of long-term development, informatization is the trend of social development in the future. People are faced with more and more electronic information and manuscripts. Traditional manual proofreading requires proofreaders to read and check the text word by word, which is cos...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
Inventor 刘云翔杜杰李晓丹郑力杜俊刘续博
Owner SHANGHAI INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products