Method and system for automatically structuring key information of document image
A key information and automatic structure technology, which is applied in the field of character recognition, can solve the problems of input file type limitation and the inability to realize fully automatic structured output, etc., and achieve the effects of reducing interference, improving user experience, and simplifying the operation process
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0190] According to a specific embodiment of the present invention, the method for automatically structuring key information of a document image according to the present invention will be described in detail with reference to the accompanying drawings.
[0191] The invention provides a method for automatically structuring key information of a document image, comprising the following steps:
[0192] S100: Obtain sample image data of the document;
[0193] S300: Perform direction correction and tilt correction preprocessing on the sample image;
[0194] S400: Recognize the text in the sample image by using optical character recognition, and organize it into a text form by line;
[0195] S500: Preprocessing the text to obtain text data in units of text blocks;
[0196] S600: combine the file data in units of text blocks with the model dictionary of the text segmentation model, convert each text block into a number sequence, and obtain the mask sequence, segment sequence and lab...
Embodiment 2
[0201] According to a specific embodiment of the present invention, the method for automatically structuring key information of a document image according to the present invention will be described in detail with reference to the accompanying drawings.
[0202] The invention provides a method for automatically structuring key information of a document image, comprising the following steps:
[0203] S100: Obtain sample image data of the document; step S100 includes the following steps:
[0204] S101: Read file data of files in multiple file formats;
[0205] S102: By setting the ID of each page of file data in the file, the file is split into single pages, and then each single page is converted into image data.
[0206] S200, loading a general text recognition model, a text segmentation model, a text classification model and a text structured extraction model and their configuration files, respectively used for text recognition, text segmentation, text classification and text ...
Embodiment 3
[0215] According to a specific embodiment of the present invention, the method for automatically structuring key information of a document image according to the present invention will be described in detail with reference to the accompanying drawings.
[0216] The invention provides a method for automatically structuring key information of a document image, comprising the following steps:
[0217] S100: Obtain sample image data of the document; step S100 includes the following steps:
[0218] S101: Read file data of files in multiple file formats;
[0219] S102: By setting the ID of each page of file data in the file, the file is split into single pages, and then each single page is converted into image data.
[0220] S200, loading a general text recognition model, a text segmentation model, a text classification model and a text structured extraction model and their configuration files, respectively used for text recognition, text segmentation, text classification and text ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com