Method, device, electronic device and storage medium for merging tables across pages of pdf document
A table and cross-page technology, applied in the field of text processing, to achieve high accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] figure 1 It is a flow chart of a method for merging tables across pages in a PDF document in an embodiment of the present invention. According to different requirements, the order of the steps in the flowchart can be changed, and some steps can be omitted.
[0054] see figure 1 As shown, the method for merging tables across pages in a PDF document specifically includes the following steps:
[0055] Step S11: Acquire at least two PDF documents containing tables, collect location information and text information of at least one table in each of the PDF documents, and obtain a table data set according to the location information of the tables.
[0056] Specifically, in at least one embodiment of the present invention, collecting the position information and text information of at least one table in each of the PDF documents, and obtaining the table data set according to the position information of the table includes:
[0057] Use the pdfplumber library to parse each of ...
Embodiment 2
[0118] figure 2 It is a structural diagram of an apparatus 30 for merging tables in a PDF document across pages in an embodiment of the present invention.
[0119] In some embodiments, the PDF document cross-page table merging apparatus 30 runs in an electronic device. The PDF document cross-page table merging apparatus 30 may include a plurality of functional modules composed of program code segments. The program codes of each program segment in the PDF document cross-page table merging apparatus 30 may be stored in the memory and executed by at least one processor to perform the PDF document cross-page table merging function.
[0120] In this embodiment, the PDF document cross-page table merging apparatus 30 may be divided into a plurality of functional modules according to the functions performed by the apparatus 30 . see figure 2 As shown, the PDF document cross-page table combining device 30 may include a table data acquisition module 301 , a training data set constr...
Embodiment 3
[0168] image 3 It is a schematic diagram of the electronic device 6 in an embodiment of the present invention.
[0169] The electronic device 6 includes a memory 61 , a processor 62 and computer readable instructions stored in the memory 61 and executable on the processor 62 . When the processor 62 executes the computer-readable instructions, the steps in the above embodiments of the PDF document cross-page table merging method are implemented, for example, figure 1 Steps S11 to S16 shown. Alternatively, when the processor 62 executes the computer-readable instructions, the functions of each module / unit in the above-mentioned embodiment of the apparatus for merging tables in a PDF document across pages are implemented, for example, figure 2 Modules 301 to 306 in .
[0170] Exemplarily, the computer-readable instructions may be divided into one or more modules / units, and the one or more modules / units are stored in the memory 61 and executed by the processor 62 to The pres...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com