Hot topic detection method based on weighted LDA and improved Single-Pass clustering algorithm
A technology of hot topics and detection methods, applied in text database clustering/classification, computing, digital data information retrieval, etc., can solve problems such as difficult to deal with massive information processing, redundant and complicated messages, etc., to improve clustering effect, enhance Differentiability, the effect of algorithmic complexity reduction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0051] The present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments.
[0052] like figure 1 As shown, the input of the method of the present invention is Chinese text, and the output is hot topics (including ranked topic words and topic cluster representative documents). First, preprocess the text data, including word segmentation, stop word filtering, feature word weighting, etc., and then use the LDA topic model to model it and filter and denoise the vectorized text; then based on the improved Single-Pass algorithm. The text after dimensionality reduction is clustered; finally, the hot topic in the topic cluster is identified by the hot topic detection method, and the hot topic is displayed by using the topic word ranking algorithm and the document distance calculation formula. The details are as follows:
[0053] Step 1: Text preprocessing; the text preprocessing of the present invention includes se...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com