Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Document table analysis method and device

A form and document technology, which is applied in the field of document form analysis methods and devices, can solve the problems of poor accuracy of form analysis and restoration, and achieve the effect of solving high complexity of restoration

Pending Publication Date: 2019-08-09
上海微投股权投资基金管理有限公司
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a method and device for parsing document tables to solve the problem of poor accuracy of parsing and restoring tables in PDF documents in the prior art.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document table analysis method and device
  • Document table analysis method and device
  • Document table analysis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. The components of the embodiments of the invention generally described and illustrated in the figures herein may be arranged and designed in a variety of different configurations.

[0052]Accordingly, the following detailed description of the embodiments of the invention provided in the accompanying drawings is not intended to limit the scope of the claimed invention, but merely represents selected embodiments of the invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art wit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a document table analysis method and device, and relates to the technical field of file analysis. The method comprises the steps of obtaining line segment feature information related to a table to be identified in a non-edited document, wherein the line segment feature information comprises coordinate information of line segments; obtaining horizontal line segment information and vertical line segment information in the to-be-identified table according to the line segment feature information; obtaining initial form information according to the horizontal line segment information and the vertical line segment information in the form to be identified; and generating an editable table corresponding to the table to be identified according to the initial table informationand the coordinate information of the line segments. By obtaining horizontal line segment information and vertical line segment information in a table to be identified through line segment feature information, obtaining initial table information according to the horizontal line segment information and the vertical line segment information, an editable table corresponding to the table to be identified is generated according to the initial table information and the line segment feature information related to the table to be identified. According to the method, the problems of high restoration complexity and poor restoration effect of the composite table and the missing table are effectively solved.

Description

technical field [0001] The present invention relates to the technical field of document parsing, in particular to a document table parsing method and device. Background technique [0002] Portable document format PDF is an electronic file format, which has nothing to do with the operating system platform, that is, PDF files are universal whether they are in Windows, Unix or Apple's Mac OS operating system. This advantage makes the PDF format It has become an ideal document format for electronic document distribution and digital information dissemination on the Internet. PDF files are based on the programming language PostScript image model, which can guarantee accurate colors and accurate printing effects on any printer, that is, PDF will faithfully reproduce every character, color and image of the original. [0003] Usually, when it is necessary to edit and change the table content in the file, the PDF format file needs to be converted into an editable format file, such as...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/24
CPCG06F40/174G06F40/177
Inventor 纪大胜苌奥林张渝洋谢华
Owner 上海微投股权投资基金管理有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products