Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Complex image and text sequence identification method

A text sequence and image technology, applied in the field of complex image text sequence recognition, can solve the problems of character size and gap difference, loss of available information in pictures, difficulty in obtaining recognition results, etc., and achieve the effect of avoiding linear growth

Inactive Publication Date: 2016-06-15
成都数联铭品科技有限公司
View PDF3 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Conventional OCR methods include image segmentation, feature extraction, single character recognition and other processing processes, wherein image segmentation includes a large number of image preprocessing processes, such as tilt correction, background denoising, and single character extraction; these processes The process is not only cumbersome and time-consuming, but also may cause the image to lose a lot of available information; and when the image to be recognized contains a string of multiple characters, the traditional OCR method needs to divide the original string into several small images containing a single character for separation. Recognition, and the most commonly used method for text segmentation is the projection method, that is, after binarizing the image text, find the dividing line between the two texts through vertical projection, and separate the text according to the dividing line. This method The main problem is: when the image text to be recognized contains background noise, character distortion, character bonding, etc., it will cause difficulty in text segmentation
Especially when the image text to be recognized is mixed with Chinese characters, letters, numbers, and symbols with left and right radicals, or the image text to be recognized is mixed with half-width and full-width characters, the size and gap of the characters are different due to the difference in format. The single character in the image text to be recognized cannot be accurately segmented through the simple projection method
Once there is a problem with segmentation, it is difficult to obtain accurate recognition results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Complex image and text sequence identification method
  • Complex image and text sequence identification method
  • Complex image and text sequence identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0063] Such as Figure 6 As shown, when the character string wrapped in the image text sequence to be recognized is: "A company's financial situation in 2015: ", the recognition results of the recursive neural network at each moment of the method of the present invention are successively: "A company 1 company 2 companies 1 company 2 2015 1 year 2 degree 1 degree 2 white 3 white 4 bei 3 bei 4 business 1 business 2 忄 1 2 冫 1 2: ", after integrating the above recognition results, the final recognition result is: "Company A 2015 Financial situation for the year:". It can be seen that the method of the present invention realizes rapid recognition of complex image text sequences mixed with left and right structure Chinese characters, numbers, letters or punctuation marks without character segmentation.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the image and text identification field, and specifically relates to a complex image and text sequence identification method. The complex image and text sequence identification method includes the steps: utilizing a sliding sampling box to perform sliding sampling on an image and text sequence to be identified; extracting the characteristics from the sub images obtained through sampling by means of a CNN and outputting the characteristics to an RNN, wherein the RNN successively identifies the front part of each character, the back part of each character, numbers, letters, punctuation, or blank according to the input signal; and successively recording and integrating the identification results for the RNN at each moment and acquiring the complete identification result, wherein the input signal for each moment for the RNN also includes the output signal of a recursion neural network for the last moment. The complex image and text sequence identification method can overcome the cutting problem of a complex image and text sequence, and can significantly improve the identification efficiency and accuracy for images and text.

Description

technical field [0001] The invention relates to the field of image character recognition, in particular to a complex image character sequence recognition method. Background technique [0002] With the development of society, there is a large demand for the digitization of ancient books, documents, bills, business cards and other paper media. The digitization here is not limited to "photographic" using scanners or cameras, but more importantly, the Files are converted into readable and editable documents for storage. This process requires image text recognition on scanned pictures, and traditional image text recognition is optical text recognition (OCR). [0003] Conventional OCR methods include image segmentation, feature extraction, single character recognition and other processing processes, wherein image segmentation includes a large number of image preprocessing processes, such as tilt correction, background denoising, and single character extraction; these processes Th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/34
CPCG06V30/158G06V30/153
Inventor 刘世林何宏靖陈炳章吴雨浓姚佳
Owner 成都数联铭品科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products