Text information processing method and device

An information processing method and text technology, applied in the field of text information processing methods and devices, can solve problems such as low accuracy, and achieve the effect of improving accuracy

Active Publication Date: 2022-07-29
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, OCR text recognition technology is very mature for printed text recognition, and the accuracy can reach more than 90%. However, for the recognition of handwritten text, the existing OCR text recognition technology has a low accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text information processing method and device
  • Text information processing method and device
  • Text information processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present application will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the related invention, but not to limit the invention. In addition, it should be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0035] It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict. The present application will be described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.

[0036] figure 1 An exemplary architecture 100 to which the textual information processing method and apparatus of the present application may be applied is shown.

[0037] like figure 1 As shown, the system architecture 100 may include ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the present application disclose a text information processing method and device, which relate to the field of cloud computing. A specific implementation of the method includes: identifying the text to be processed from an image including the text to be processed; inputting the text to be processed into a pre-trained recurrent neural network language model to identify typos in the text to be processed; The typos are input into the pre-trained text error correction model, and the similar words corresponding to the typos are obtained; the text error correction model is used to determine the correct words corresponding to the typos in the similar words based on the coherence of the text to be processed, and the correct words are used to replace the typos, and get Error correction text for the text to be processed. The present application recognizes typos through a pre-trained recurrent neural network language model, and obtains the correct text of the typo through a pre-trained text error correction model, thereby obtaining error-corrected text and improving the accuracy of the recognition result.

Description

technical field [0001] The embodiments of the present application relate to the field of computer technologies, and in particular, to a text information processing method and device. Background technique [0002] With the development of computer technology, OCR (Optical Character Recognition, Optical Character Recognition) character recognition technology is widely used in various fields. OCR text recognition technology can convert image information into text information, and then the machine uses natural language processing technology to perform semantic analysis and intent recognition on the text. [0003] At present, OCR text recognition technology is very mature for printed text recognition, and the accuracy can reach more than 90%. However, for handwritten text recognition, the existing OCR text recognition technology has low accuracy. [0004] In the prior art, the correction of the recognition result obtained by recognizing the handwritten text by the OCR technology ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/22G06F40/232
CPCG06V10/22
Inventor 冯博豪陈兴波张小帅杨舰
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products