Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text recognition method and device, electronic equipment and readable storage medium

A text recognition and to-be-recognized technology, applied in the field of computer vision, can solve the problems of underutilized dense block output features, underutilized hierarchical information, and slow calculation speed of convolution operations, and achieves enhanced representation learning. Capability, simple structure, the effect of improving accuracy

Pending Publication Date: 2022-01-07
SUZHOU UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the pooling operation cannot be learned and may lose important information, the accuracy of the entire text recognition is not high
In addition, the traditional convolution operation is also computationally slow, so it should be replaced by a more efficient convolution operation
In addition, although dense blocks in DenseNet have good mobility and coupling of internal features, dense blocks and transition blocks in DenseNet are simply stacked together
In this way, the output features of each dense block are not well utilized, e.g., the hierarchical information of different layers is not fully utilized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text recognition method and device, electronic equipment and readable storage medium
  • Text recognition method and device, electronic equipment and readable storage medium
  • Text recognition method and device, electronic equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0077] The above embodiment does not limit the process of how to use the text recognition network model to perform text recognition on the input image data. This embodiment also provides an implementation manner, which may include the following content:

[0078] Determine the target extraction method according to the occupied space capacity of the text data to be recognized, and extract the image features of the text data to be recognized by the target extraction method; input the image features to the dense fusion block to obtain the target image features; input the target image features to the transcription layer, Get the text recognition result.

[0079] Theoretically, this embodiment uses a deeper network and a larger-sized input image containing more feature information to obtain better performance. However, due to the limitation of computing power and resources, it is necessary to use a downsampling strategy for large-scale features. In fact, this embodiment can enlarge...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text recognition method and device, electronic equipment and a readable storage medium. The method comprises the following steps: constructing a text recognition network model based on a convolutional neural network in advance; the text recognition network model comprises a feature extraction layer, a dense fusion block and a transcription layer. The dense fusion block comprises a first dense block, a second dense block and a convolution operation layer, the first dense block is connected with the second dense block, and the dense fusion block is used for performing fusion processing on the image features extracted by the feature extraction layer, the output features of the first dense block and the output features of the second dense block in different layers through connection operation; the transcription layer includes a classifier and a loss function layer. The accuracy and the recognition efficiency of text recognition can be effectively improved. And inputting the to-be-recognized text data into the trained text recognition network model to obtain a text recognition result, thereby effectively improving the accuracy and recognition efficiency of text recognition in the image data.

Description

technical field [0001] The present application relates to the technical field of computer vision, in particular to a text recognition method, device, electronic equipment and readable storage medium. Background technique [0002] Optical character recognition (OCR) is an important topic in the field of pattern recognition, and its purpose is to recognize text in images, which includes characters and numbers. Although OCR has been extensively studied for decades, accurately recognizing text in natural images is still a challenging task due to the complexity of background and content in images. In fact, characters and / or numbers may appear differently in different images due to variations in style, font, resolution or lighting. In recent years, with breakthroughs in the fields of computer vision and deep learning, end-to-end text recognition frameworks have been developed, which include complex two-step pipelines. The first step is to detect regions of text in the image, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/047G06N3/045G06F18/253
Inventor 张召赵随意张莉王邦军
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products