Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A device and method for recovering pdf files

A document recovery and file technology, which is applied in the field of PDF file processing, can solve the problems of subtracting 1 from the initial offset value, difficult PDF document recovery, and incorrect recovery, etc., and achieves the effect of easy implementation and modular design

Active Publication Date: 2018-12-21
GUANGZHOU HUIRUI SITONG INFORMATION SCI & TECH CO LTD
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

After decryption, the encrypted dictionary object is removed, which causes the content length of some objects to change. For example, the object number of object A is 100 before decryption, but becomes 99 after decryption. The length of the object number changes from 3 to 2, and the total length of object A The length is also correspondingly reduced by 1, which in turn causes the starting offset value of all objects behind object A to be reduced by 1; in addition, object A, as an indirect reference object, may also be referenced by object B, so the above problems will also occur
Since the positions of objects A and B in the PDF file structure are not known, the starting offset value of each object cannot be pre-calculated, but the starting offset value of each object is calculated in real time, thus bringing PDF document recovery belt Difficult to recover PDF documents quickly
In addition, assuming that the length of each object remains unchanged before and after decryption, although the initial offset value of each object after decryption can be quickly calculated, it is still necessary to process the object whose length has changed. If the processing is not correct, the PDF document cannot be restored correctly.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A device and method for recovering pdf files
  • A device and method for recovering pdf files
  • A device and method for recovering pdf files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further described below in conjunction with specific examples.

[0037] The PDF document restoring device described in the present invention comprises:

[0038] The global key acquisition unit is composed of an encrypted dictionary object content extraction unit, a data index unit, a rainbow table data storage unit, and a key output unit (such as figure 1shown), the global key acquisition unit utilizes the rainbow table to obtain the global key of the document to be decrypted;

[0039] The PDF title and linearization parameter dictionary object generation unit is composed of a title generation unit, a linearization parameter dictionary object generation unit, and a content output unit (such as figure 2 shown), the PDF title and linearization parameter dictionary object generating unit is used to generate the title and linearization parameter dictionary object content;

[0040] The cross-reference table pre-generation unit is composed of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a PDF document recovery device and method; the device comprises the following units: an overall key obtain unit using a rainbow table to obtain the overall key of a to-be-decrypted document; a PDF title and linearization parameter dictionary object generation unit used for generating a title and linearization parameter dictionary object content; a cross-reference table generation unit used for generating a main cross-reference table and a second cross-reference table; a tail dictionary generation unit used for generating a tail dictionary corresponding to the main cross-reference table and second cross-reference table; an object number processing unit used for processing a decrypted object number; an indirect reference object processing unit used for processing object numbers of all reference objects in the present object; an object content processing unit used for determining the type of the object content, and decrypting a flow object. The novel device and method can neglect the initial deviant calculating difficulty caused by content length changes before and after decryption; modularization design is convenient, thus fast recovering PDF documents.

Description

technical field [0001] The invention relates to the field of PDF (Portable Document Fotmat) file processing, in particular to a device and method for restoring a PDF file. Background technique [0002] As we all know, the full name of PDF is Protable Document Format, which can be translated into Portable Document Format and is an electronic file format. This file format has nothing to do with the operating system, that is to say, PDF files are universal no matter whether they are Windows, Uinx or Apple's Mac OS operating systems. Ideal document format for information dissemination. More and more e-books, product descriptions, e-mails, and network materials start to use PDF format files. [0003] The PDF file structure mainly consists of header, linearization parameter dictionary, second cross-reference table and corresponding tail dictionary, object content, main cross-reference table and corresponding tail dictionary. [0004] From 1993 to now, the PDF specification has ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F21/60
CPCG06F21/602G06F2221/2107
Inventor 黄晓光
Owner GUANGZHOU HUIRUI SITONG INFORMATION SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products