Text positioning box correction method and system based on convolutional neural network
A convolutional neural network and text positioning technology, which is applied in biological neural network models, neural architectures, instruments, etc., can solve the problems of unframed text, insufficient precision, and text included, so as to reduce uniform size and reduce the amount of calculation. , targeted effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0063] Such as figure 1 As shown, the text positioning frame correction method based on the convolutional neural network includes the following steps:
[0064] S1: Obtain multiple text images to be located.
[0065] S2: Input the acquired multiple text images to be positioned into the text detection model, the text detection model performs rough text positioning on the text images to be positioned, and outputs the positioned text images and the text positioning frame to be corrected The coordinate values of the four upper and lower endpoints at the left and right ends of .
[0066] Such as figure 2 As shown, this picture is a text image, that is, an image with text. The rectangular frame is the text positioning frame obtained by the text detection model through rough positioning. It can be seen that the upper part of the text positioning frame does not include the upper part of all the text, which will lead to subsequent recognition errors. The corner points marked by t...
Embodiment 2
[0115] The text positioning frame correction system based on the convolutional neural network includes a memory and a processor, the memory stores instructions, and the instructions are suitable for being loaded by the processor and performing the following steps:
[0116] S1: Obtain multiple text images to be positioned.
[0117] S2: Input the acquired multiple text images to be positioned into the text detection model, the text detection model performs rough text positioning on the text images to be positioned, and outputs the positioned text images and the text positioning frame to be corrected The coordinate values of the four upper and lower endpoints at the left and right ends of .
[0118] S3: Establish a text positioning frame correction model, and train the text positioning frame correction model.
[0119] S4: After cropping and scaling the text positioning frame to be corrected and its corresponding image content, input the trained text positioning frame correctio...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com