Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and device for structured document organization

A document structure and document technology, applied in unstructured text data retrieval, text database clustering/classification, instruments, etc., can solve problems such as inability to reflect, difficult to understand, etc., to achieve the effect of easy reading

Active Publication Date: 2018-11-09
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

That is to say, there may be a sequential or hierarchical relationship between document contents, but these relationships cannot be reflected only by the existing document classification system
For users, they can only blindly read each document under a certain category, causing difficulty in understanding

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for structured document organization
  • Method and device for structured document organization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] The ideal document organization method should have a relatively clear hierarchical division. Taking the "Guidelines for Patent Examination" as an example, the document organization structure is as follows:

[0065] Part I Preliminary Review

[0066] Chapter 1 Preliminary Examination of Invention Patents

[0067] 1 Introduction

[0068] 2. Review Principles

[0069] 3. Review procedure

[0070] 3.1 Passed the preliminary examination

[0071] 3.2 Supplement and Correction of Application Documents

[0072] 3.3 Handling of obvious substantive defects

[0073] ...

[0074] 4. Formal review of application documents

[0075] ...

[0076] Chapter II Preliminary Examination of Utility Model Patents

[0077] ...

[0078] Part II Substantive Examination

[0079] ...

[0080] Part III Examination of International Applications Entering the National Phase

[0081] ...

[0082] In some UGC platforms, users often upload some of their own documents for all users to share. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a document structuration organizing method and device. The document structuration organizing method includes the steps of obtaining a theme framework of a hierarchical structure, forming a searching condition through a theme text in the theme framework, carrying out searching in a preset document set with the searching condition, and adding a document into a corresponding theme document set in the theme framework according to the matching condition of the searching result and the searching condition. Compared with the prior art, the technical scheme of the document structuration organizing method and device can be used for automatically building proper classification systems according to different knowledge fields; as the theme framework is built with mature expert knowledge, inner links of classifications can be well reflected, and a user can conveniently read a large number of texts in a systematized mode.

Description

technical field [0001] The invention relates to the field of computer application technology, in particular to a method and device for structured document organization. Background technique [0002] With the development of Internet technology, the amount of information on the Internet has exploded. In order to apply these information better, it is necessary to manage these information data effectively. Among them, document classification (document classification) is currently a widely used management technology. Document classification refers to determining a category for each document in the document collection according to the content or certain attributes of the document. In this way, users can not only browse documents in a specific category conveniently, but also make finding documents easier by limiting the search scope. [0003] However, for massive document resources, even after a certain classification process, there will still be a large number of documents unde...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 徐兴军
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products