Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for detecting and identifying continuous segmented texts in image

A text detection and recognition method technology, applied in character and pattern recognition, instruments, computer parts and other directions, can solve problems such as low recognition accuracy, difficult font segmentation, and difficult positioning.

Inactive Publication Date: 2019-11-01
SHANGHAI MARITIME UNIVERSITY
View PDF10 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention proposes a continuous text detection and recognition method in an image based on fusion processing of SegLink and Attention-based CRNN. The convolutional neural network performs feature extraction, and the obtained feature sequence is input into the bidirectional long-term short-term memory cyclic neural network to complete the mapping from the feature sequence to the character sequence, and then the character sequence is CTC transcribed to obtain the final recognition result; thus Solve the problems of text detection in the digitization of OCR information documents, especially the low accuracy of oblique text detection, difficult positioning, difficult font segmentation, and low recognition accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for detecting and identifying continuous segmented texts in image
  • Method for detecting and identifying continuous segmented texts in image
  • Method for detecting and identifying continuous segmented texts in image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0082] To make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. The present invention includes, but is not limited to, the following examples.

[0083] Such as figure 1 Shown is the overall implementation flow chart of the continuous segmented text detection and recognition method in the image based on SegLink and Attention-based CRNN fusion processing of the present invention, and the specific steps are as follows:

[0084] Step 1. Select and make data sets: Use CycleGAN to make continuous text image data sets of various fonts and divide the data sets into training set, verification set and test set.

[0085] Described step 1 comprises the following process: make and generate the single-word data set pictures of different ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for detecting and identifying continuous segmented texts in an image based on SegLink and Attention-based CRNN fusion processing, and belongs to the technical field ofoptical character recognition, aiming to solve the problems of low text detection accuracy, particularly low inclined text detection accuracy, difficulty in positioning, difficulty in font segmentation and low recognition accuracy in OCR information document digitization. The method includes the steps: establishing a SegLink + CRNN model based on a Tensorflow deep learning framework, detecting text lines in an image through a SegLink network; segmenting the segmented text according to lines; extracting single-line text features through a densely connected convolutional neural network; processing the sequence information of the context in the text by the bidirectional recurrent neural network, and adopting the CTC decoding algorithm to avoid the problem of single word segmentation, and eliminate the influence of the single word segmentation link on the recognition accuracy; and further fusing an Attention mechanism during CTC transcription to improve the recognition accuracy for the text sequence characteristics. The method is applicable to printed form and handwritten form recognition, and can be applied to recognition of multilingual texts such as English and Chinese.

Description

technical field [0001] The invention belongs to the technical field of computer vision, target detection and optical character recognition, relates to text detection and recognition in information documents, and in particular to a continuous text detection and recognition method in images based on fusion processing of SegLink and Attention-based CRNN . Background technique [0002] Text detection and recognition in natural scenes is currently one of the most concerned areas in computer vision, which includes two subtasks: text detection and text recognition. [0003] Most of the current text detection methods are based on a bottom-up process, detecting from low-level features such as simple characters and strokes, and then performing non-text filtering, text line construction, and text line verification. The accuracy of this method largely depends on the results of character detection, and the errors of the detection results will continue to accumulate in the bottom-up proc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/32G06K9/62
CPCG06V30/40G06V30/1478G06V20/62G06F18/24G06F18/214
Inventor 刘晋龚沛朱
Owner SHANGHAI MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products