Text string recognition method and device and electronic equipment
A text recognition and text technology, which is applied in the field of recognition processing, can solve the problems of lower recognition accuracy of text strings, achieve the effect of reducing the amount of calculation and realizing simplicity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0067] In Embodiment 1, the trained text recognition model may include a first spatial attention module. Here, the first spatial attention module is just named for convenience of description, and is not used for limitation.
[0068] In Embodiment 1, the first spatial attention module is used to obtain channel feature maps of S*S channels based on the trained text string position supervision information and based on the input image feature maps. Among them, the S*S channels are the channels configured by the first spatial attention module.
[0069] Based on this, determining the channel feature maps of S*S channels according to the image feature maps of the target image in the above step 101 may include: inputting the image feature maps of the target images into the first spatial attention module in the trained text recognition model To get the channel feature map of S*S channels.
[0070] The following describes how the first spatial attention module obtains channel feature ...
Embodiment 2
[0081] The above describes how the first spatial attention module in Embodiment 1 learns the channel feature maps of S*S channels based on the input image feature maps. Embodiment 2 is described below:
[0082] Example 2:
[0083] In Embodiment 2, the trained text recognition model may include a second spatial attention module. Here, the second spatial attention module is just named for convenience of description, and is not used for limitation.
[0084] In Embodiment 2, the second spatial attention module is used to determine channel feature maps of S*S channels based on the trained word position supervision information and based on the input image feature maps. In Embodiment 2, the channel feature map of each channel includes L word position segmentation maps, and each word position segmentation map is used to characterize the word position. In the second embodiment, the word positions corresponding to the L word position segmentation maps included in each channel feature...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com