Microblog-oriented dynamic topic detection and evolution tracking method
A topic and microblog technology, applied in the field of microblog-oriented dynamic topic detection and evolution tracking, can solve the problem that the topic detection system cannot analyze and calculate the topic evolution trend.
Inactive Publication Date: 2014-12-10
中科明远(北京)并行软件有限公司
View PDF4 Cites 44 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
[0008] Aiming at the defect that the existing theme detection system cannot analyze and calculate the evolution trend of the theme, the present invention calculates the similarity relationship between themes in different time periods in real time through hierarchical clustering, thereby analyzing the evolution trend of the theme over time, and drawing the theme Evolution trend graph, specifically proposes a dynamic topic detection and evolution tracking method for Weibo
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More PUM
Login to View More
Abstract
The invention provides a microblog-oriented dynamic topic detecting and evolution tracking method and belongs to the technical field of intelligent information processing. The method includes the steps of 1, establishing a distributed crawler to acquire microblog data; 2, pre-processing the microblog data; 3, performing Chinese word segmentation to remove stop words, and acquiring a word set VOC; 4, subjecting the microblog data to LDA (latent Dirichlet allocation) clustering in each time interval so as to extract latent topics; 5, screening out microblog hot topics in each time interval; 6, subjecting the hot topics of a global time to hierarchical clustering to acquire inter-topic aggregation and differentiation relations; 7, visualizing a topic evolution process according to the inter-topic aggregation and differentiation relations. The method has the advantages such that topic word distribution of an event in different times and a fine-grained topic of a same topic in different times are mined under low time complexity, efficiency is high, and robustness is high; the method has greater practical value.
Description
technical field [0001] The invention belongs to the technical field of intelligent information processing, and in particular relates to a microblog-oriented dynamic topic detection and evolution tracking method. Background technique [0002] With the explosive growth of text information on the Internet, it is becoming more and more difficult for people to obtain interesting topics or event information in a timely manner from massive text information. Topic detecting and tracking (TDT) technology aims to organize language and text information flow according to events, and develop a series of core technologies that can meet the needs of the above users. Topic tracking is one of the subtasks of TDT. Topic tracking can help people effectively gather and organize scattered information, and understand all the information about a topic as a whole. As a new research direction of TDT, the topic evolution analysis aims to discover the relationship between various events in the topic ...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More Application Information
Patent Timeline
Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/9535G06F16/285G06F16/951
Inventor 闫碧莹邓攀余雷赵鑫袁伟万安格
Owner 中科明远(北京)并行软件有限公司
Who we serve
- R&D Engineer
- R&D Manager
- IP Professional
Why Patsnap Eureka
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com