Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text detection method based on channel grouping attention mechanism

A text detection and attention technology, applied in the field of text detection, can solve the problems that the text detection accuracy needs to be improved and the text aspect ratio changes drastically, so as to improve the text offset prediction and candidate frame regression effect, improve the text detection accuracy, Optimizing the effect of semantic information

Active Publication Date: 2021-03-05
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF20 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This type of method treats the text as an object and has good performance, but the aspect ratio of the text itself changes drastically, and the detection accuracy of the text may have problems such as skew and distortion.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text detection method based on channel grouping attention mechanism
  • Text detection method based on channel grouping attention mechanism
  • Text detection method based on channel grouping attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The method of attention module to generate attention heat map is as follows: figure 1 As shown, first perform 1x1 global pooling on the predicted feature spectrum of a quarter resolution of the image to be detected (select the feature spectrum output from the convolutional layer conv4-3), and use convolution conv to compress the number of channels to the original A quarter of the number of channels is activated using the relu activation function to realize the encoding part of the encoding and decoding model. Then use the group convolution group-conv to restore the number of channels to the original number of channels, use the sigmoid function to activate, and perform upsampling to restore the reshape to the original scale size to obtain an attention activation heat map that is consistent with the original predicted feature spectrum.

[0021] The embodiment is implemented on the TITAN X server, such as figure 2 It mainly includes several steps: the backbone network ex...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text detection method based on a channel grouping attention mechanism, and aims to solve the problem that targets of different scales adopt preset boxes of different scales ona characteristic spectrum, and a higher-resolution prediction characteristic spectrum with more spatial information before fusion cannot well represent text characteristics and is poorer in effect when being directly input into a detection head. After experiments, the applicant finds that an attention module is introduced into a prediction characteristic spectrum with a quarter resolution, the text offset prediction and candidate box regression effects of the TextBoxes_plusplus algorithm under the quarter scale are improved, and therefore the method adapts to the changeable length-width ratioof the text, missing detection and false detection are reduced, and the characteristic robustness is well enhanced. Spatial information is reserved, and small target text detection precision is improved. In addition, convolution and grouping convolution are used for replacing full connection in encoding and decoding of the attention mechanism, the parameter quantity and the calculation complexityare reduced, and compared with attention of a common channel, the attention mechanism is efficiently achieved under the condition of performance approximation.

Description

technical field [0001] The invention relates to text detection technology, in particular to channel attention in text detection. Background technique [0002] OCR (Optical Character Recignition) optical character recognition, with the development of digital multimedia technology, now generally refers to image text recognition, and text information needs to be extracted in various visual data analysis and applications. Text detection aims to find areas where text is located. The purpose of character recognition is to identify the character category of the region of interest. Reading and recognizing text on computer pictures, inputting data such as periodical collection forms into computers for processing and storage, all support the rapid development of text detection and recognition methods. Further combined with emerging directions such as advertising recommendation, material classification, and video review, OCR is full of enduring vitality. The current mainstream OCR t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/20G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V10/22G06V10/44G06N3/045G06F18/253
Inventor 李宏亮李泊琦戚耀钟子涵
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products