Method and system for automatically structuring key information of document image

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A key information and automatic structure technology, which is applied in the field of character recognition, can solve the problems of input file type limitation and the inability to realize fully automatic structured output, etc., and achieve the effects of reducing interference, improving user experience, and simplifying the operation process

Active Publication Date: 2022-04-12

北京译图智讯科技有限公司

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] 1. The optical character recognition processing method can only achieve structured output for fixed types of text content, and cannot achieve fully automatic structured output;

[0007] 2. There are restrictions on the input file type, which needs to be a preset file type

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0190] According to a specific embodiment of the present invention, the method for automatically structuring key information of a document image according to the present invention will be described in detail with reference to the accompanying drawings.

[0191] The invention provides a method for automatically structuring key information of a document image, comprising the following steps:

[0192] S100: Obtain sample image data of the document;

[0193] S300: Perform direction correction and tilt correction preprocessing on the sample image;

[0194] S400: Recognize the text in the sample image by using optical character recognition, and organize it into a text form by line;

[0195] S500: Preprocessing the text to obtain text data in units of text blocks;

[0196] S600: combine the file data in units of text blocks with the model dictionary of the text segmentation model, convert each text block into a number sequence, and obtain the mask sequence, segment sequence and lab...

Embodiment 2

[0201] According to a specific embodiment of the present invention, the method for automatically structuring key information of a document image according to the present invention will be described in detail with reference to the accompanying drawings.

[0202] The invention provides a method for automatically structuring key information of a document image, comprising the following steps:

[0203] S100: Obtain sample image data of the document; step S100 includes the following steps:

[0204] S101: Read file data of files in multiple file formats;

[0205] S102: By setting the ID of each page of file data in the file, the file is split into single pages, and then each single page is converted into image data.

[0206] S200, loading a general text recognition model, a text segmentation model, a text classification model and a text structured extraction model and their configuration files, respectively used for text recognition, text segmentation, text classification and text ...

Embodiment 3

[0215] According to a specific embodiment of the present invention, the method for automatically structuring key information of a document image according to the present invention will be described in detail with reference to the accompanying drawings.

[0216] The invention provides a method for automatically structuring key information of a document image, comprising the following steps:

[0217] S100: Obtain sample image data of the document; step S100 includes the following steps:

[0218] S101: Read file data of files in multiple file formats;

[0219] S102: By setting the ID of each page of file data in the file, the file is split into single pages, and then each single page is converted into image data.

[0220] S200, loading a general text recognition model, a text segmentation model, a text classification model and a text structured extraction model and their configuration files, respectively used for text recognition, text segmentation, text classification and text ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a document image key information automatic structuring method and system, and belongs to the technical field of character recognition. The method comprises the following steps of: recognizing characters in a file by adopting optical characters, sorting the characters into text blocks, segmenting the text blocks through a text segmentation model and a text segmentation model dictionary, classifying the text blocks through a text classification model and a text classification model dictionary, and finally predicting the text blocks through a prediction model and a prediction model dictionary. Extracting key value pair data conforming to the rule according to the prediction result; and carrying out preset format processing on the extracted structured data and then displaying the processed structured data. According to the method, any file type can be identified, the structured identification method of the automatic structured output result is achieved, the method is suitable for most common voucher reports in various styles such as list type and table type, the method can adapt to complex scenes of various voucher reports, automatic structured output is completed in a unified mode, and a user does not need to configure and adjust the method.

Description

technical field [0001] The invention relates to the technical field of character recognition, in particular to a method and system for automatically structuring key information of document images. Background technique [0002] Computer text recognition, commonly known as optical character recognition, the English full name is Optical Character Recognition (OCR for short), which is a technology that uses optical technology and computer technology to extract the text on the drawing in text form and convert it into a format that humans can understand. In the era of information society, a large amount of bills, forms, and certificate data are generated every day. These data need to be digitized and need to be extracted and entered using optical character recognition technology. [0003] With the development of the industry and the maturity of technology, optical character recognition has been applied to many industries, such as sorting and express delivery in the logistics field...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06F16/33G06F16/31G06F16/35G06F40/242G06F40/279G06V30/413G06V30/146G06V30/19G06K9/62G06N3/04G06N3/08

Inventor 王燚王伟饶顶锋陶坚坚刘伟

Owner 北京译图智讯科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method and system for automatically structuring key information of document image

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology