Large scale text data external clustering method and system
A text data and clustering method technology, applied in the information field, can solve problems such as incomputable space complexity, and achieve the effect of small space occupation, large capacity, and novel and scientific ideas
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0015] Referring to the accompanying drawings, an external clustering method and system for large-scale text data, the main steps of the method include: preprocessing an input text set, generating an inverted index and a feature vector of the text set; using retrieval technology to retrieve candidates for each document relationship collection; use the relationship calculation method to perform relationship calculation on documents with candidate relationships; sort and output the calculation results that are greater than a certain threshold; clustering algorithm and then according to the sorting results, iteratively merges the text pairs with the first direct relationship, and finally achieves the text pair with the first direct relationship. The clustering output of the set; the clustering system designed by the external clustering method of large-scale text data, including a candidate analyzer, relationship generator, relationship selection and clustering components, the basic...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com