Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text paragraph structure restoration method, device, equipment and computer storage medium

A text and paragraph technology, applied in the field of computer-readable storage media and text paragraph structure restoration

Active Publication Date: 2021-04-06
ONE CONNECT SMART TECH CO LTD SHENZHEN
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of the present invention is to provide a text paragraph structure restoration method, device, equipment and computer storage medium, aiming at solving the technical problem of how to improve the accuracy of text paragraph structure restoration

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text paragraph structure restoration method, device, equipment and computer storage medium
  • Text paragraph structure restoration method, device, equipment and computer storage medium
  • Text paragraph structure restoration method, device, equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0041] like figure 1 as shown, figure 1 It is a structural schematic diagram of a text paragraph structure restoration device of the hardware operating environment involved in the solution of the embodiment of the present invention.

[0042] like figure 1 As shown, the device for restoring text paragraph structure may include: a processor 1001 , such as a CPU, a network interface 1004 , a user interface 1003 , a memory 1005 , and a communication bus 1002 . Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. Optionally, the network interface 1004...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to the technical field of image processing, and discloses a text paragraph structure restoration method, device, equipment and computer storage medium. The method includes: identifying a target picture, and determining all texts in the target picture based on the recognition result of the recognition. The text box and the text box position of each of the text boxes; sorting each of the text boxes according to the position of each of the text boxes, and inputting the text features of each of the text boxes to the preset based on the sorting result of the sorting The deep learning model is trained; based on the training results of the training, the text boxes are merged to obtain all the text paragraphs corresponding to the target picture. The invention improves the accuracy of text paragraph structure restoration.

Description

technical field [0001] The present invention relates to the technical field of image processing, in particular to a text paragraph structure restoration method, device, equipment and computer-readable storage medium. Background technique [0002] In the process of digitizing paper documents, it is necessary to enter the documents and retain the original format. Currently, text line-based detection and recognition methods cannot directly obtain text paragraph information. Currently, there are two methods, that is, top-down, that is, the layout analysis of the entire page is performed first, paragraphs are segmented, and then text lines in the paragraph area are detected and recognized. This type of method cannot capture local text detail features when doing layout analysis, and only uses image information without text content information, and the accuracy rate is not high. Or bottom-up, that is, first detect the text lines, and then merge the text lines to obtain paragraphs....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00G06F40/166
CPCG06F40/166G06V30/412G06V30/10
Inventor 高超徐国强
Owner ONE CONNECT SMART TECH CO LTD SHENZHEN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products