Microblog data analysis based hot news prediction method and system
A technology of data analysis and prediction method, which is applied in the direction of network data retrieval, network data indexing, electronic digital data processing, etc. It can solve problems such as the inability to find hot topics and the inability to comprehensively analyze the characteristics of hot topics, so as to solve the problem of early prediction Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0041] like figure 1 and figure 2 As shown, the hot news prediction method based on microblog data analysis of the present embodiment includes the following steps:
[0042] S1. Collect news reports from mainstream news websites and the microblog user reaction information caused by them on microblogs. The news reports include titles and texts, and the microblog user reaction information is searched on microblogs using news titles as keywords. The microblog result set includes microblog user information, microblog text, posting time, but does not include news reports in microblog by news media;
[0043] S2. Perform word segmentation and word frequency statistics on the microblog text, calculate the TF-IDF (termfrequency-inversedocumentfrequency) value of the word, and convert it into a microblog topic using a vector space description;
[0044] S3. Classify the microblog topics, describe the three quantitative indicators of the microblog topics, and calculate the three popular...
Embodiment 2
[0078] like image 3 As shown, the hot news prediction system based on microblog data analysis of the present embodiment, the system includes:
[0079] The data collection module is used to collect news reports from mainstream websites and the reaction information of Weibo users on Weibo;
[0080] The text analysis processing module is used to perform word segmentation and word frequency statistics on the microblog text, calculate the TF-IDF value of the word, and convert it into a vector space to describe a microblog topic;
[0081] The data statistical analysis module is used to classify Weibo topics, count and describe various quantitative indicators of Weibo topics, and calculate various popularity indicators of news;
[0082] The hot news prediction module is used to use the multiple linear regression algorithm to learn the sample data, establish a hot news prediction model, and judge whether the following news will become a hot news according to the hot news prediction ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com