Rearrangement method and system based on document similarity
A similarity and file technology, applied in the field of text similarity calculation and detection, can solve problems such as inability to process oriental languages, unusable, single scope of application, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments.
[0047]The weight ranking method based on file similarity in the embodiment of the present invention is based on the following three basic assumptions:
[0048] (1) Judgment of document similarity by text content: When analyzing and determining document similarity, only the text content in the document is considered and non-text content is ignored.
[0049] (2) Judging the similarity of documents through basic units: In the text content of documents, sentences are used as the basic units for calculating the similarity of documents, that is, the more basic units that are "similar" in two documents, the higher their relative similarity is. high. Further, if multiple basic units in one document are similar to those in other document collections, the higher the similarity between the current document and the current document collection is.
[005...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com