Table data extraction method, device and equipment and computer storage medium

A technology of tabular data and extraction methods, applied in computer parts, calculation, image data processing, etc.

Pending Publication Date: 2021-02-26
WEBANK (CHINA)
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present application provides a form data extraction method, device, electronic equipment and co

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Table data extraction method, device and equipment and computer storage medium
  • Table data extraction method, device and equipment and computer storage medium
  • Table data extraction method, device and equipment and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] In the related art, the data of the table image can be extracted by using an image morphology transformation method; figure 1 It is a schematic flow chart of extracting form image data in related technologies, refer to figure 1 , the process can include:

[0082] Step 101: Obtain the form image.

[0083] Exemplarily, the above-mentioned form image may be an image of a financial statement, which is an accounting statement reflecting the capital and profit status of an enterprise or a budgetary unit in a certain period.

[0084] Step 102: Perform table line detection on the table image to obtain table lines of the table image.

[0085] Step 103: Segment cells based on the table lines of the table image.

[0086] Step 104: Use an optical character recognition (Optical Character Recognition, OCR) method to identify the content in each cell.

[0087] Step 105: Output the content of each cell in the form of text.

[0088] In the related art, the scheme of extracting the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a table data extraction method and device, electronic equipment and a computer storage medium. The method comprises the steps of obtaining a table image; detecting a table line of the table image to obtain a detection result; dividing a plurality of cells according to a detection result; dividing the plurality of cells into at least one title cell and at least one data cell according to a pre-acquired title image library; and determining a title text in at least one title cell and a data text in at least one data cell. According to the title text in theat least one title cell and the data text in the at least one data cell, at least one set of table data is obtained, and each set of table data in the at least one set of table data comprises at least one title text and a data text corresponding to the at least one title text.

Description

technical field [0001] This application relates to the data collection technology of financial technology (Fintech), involving but not limited to a form data extraction method, device, electronic equipment and computer storage medium. Background technique [0002] With the development of computer technology, more and more technologies are applied in the financial field, and the traditional financial industry is gradually transforming into financial technology. However, due to the security and real-time requirements of the financial industry, higher requirements are also placed on technology. [0003] At present, the image morphological transformation method can be used to identify the table frame of the table image, and then segment the cells, and then realize the data extraction in the table. However, in related technologies, after obtaining the cells, different types of cells The unified identification of the content of the grid results in a messy identified content, which...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/20G06T5/00G06T7/13
CPCG06T5/006G06T7/13G06T2207/10004G06T2207/30176G06V30/412G06V30/414G06V10/22
Inventor 叶树健江旻杨杨徐为凯
Owner WEBANK (CHINA)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products