Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Acquisition method and device of PDF (portable document format) document directory

An acquisition method and acquisition device technology, which are applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of directory modification and editing difficulties, and achieve the effect of convenient editing and modification.

Inactive Publication Date: 2016-03-30
PEKING UNIV FOUNDER GRP CO LTD +2
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to provide a method and device for obtaining the catalog of a PDF document, which can solve the problem of difficulty in modifying and editing the catalog in the PDF document in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Acquisition method and device of PDF (portable document format) document directory
  • Acquisition method and device of PDF (portable document format) document directory
  • Acquisition method and device of PDF (portable document format) document directory

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

[0070] Such as figure 1 As shown, the method for obtaining the PDF document directory of the present invention includes:

[0071] Step 11, analyzing the architecture of the PDF document to obtain the cross-reference table of the PDF document;

[0072] Step 12, searching the cross-reference table to obtain the TRAILER dictionary at the end of the file;

[0073] Step 13, analyzing the TRAILER dictionary at the end of the file to obtain the CATALOG dictionary corresponding to the key value ROOT;

[0074] Step 14, searching the catalog CATALOG dictionary to obtain the catalog of the PDF document.

[0075] The solution of the invention can conveniently, accurately and efficiently extract the table of contents in the PDF document.

[0076] Wherein, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an acquisition method and a device of a PDF document directory, wherein the method comprises: analyzing the system structure of a PDF document, acquiring the intersection index table of the PDF document; retrieving the intersection index table, obtaining an end-of-file TRAILER dictionary; analyzing the end-of-file TRAILER dictionary, obtaining a directory book CATALOG dictionary corresponding to a key value ROOT; retrieving the directory book CATALOG dictionary, obtaining the directory of the PDF document. The solution of the invention can conveniently, quickly and accurately extract the directory of the PDF document in high efficiency and is convenient for editing and modifying the extracted directory of the PDF document subsequently.

Description

technical field [0001] The invention relates to the field of information extraction, in particular to a method and device for acquiring a PDF document catalog. Background technique [0002] PDF, the full name of PortableDocumentFormat, is "Portable Document Format", which is an electronic document format. This format has nothing to do with the operating platform. It has outstanding cross-platform features and can be used on almost all platforms. This feature makes it the preferred document format for electronic document distribution and digital information dissemination on the Internet. More and more books and documents prefer PDF as the form of electronic publication, such as electronic books, product descriptions, company announcements, and Internet. data, e-mail, etc. The PDF format has become a de facto industry standard for digitizing information. [0003] The PDF format has its distinctive technical characteristics, such as superior cross-platform; it can integrate m...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/21
Inventor 刘利川
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products