Unsupervised multi-document abstract generation method for public opinion analysis
A public opinion analysis, multi-document technology, applied in the field of unsupervised generation of document summaries, can solve the problems of poor practicability of generative summaries, lack of Chinese public opinion summaries training corpus, low effect, etc. The effect of the search space
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0059] A method for generating unsupervised multi-document summarization oriented to public opinion analysis, the generating method comprising the following steps:
[0060] Step 1: Collect network public opinion news in real time, and automatically divide news collections according to network hotspots;
[0061] Step 2: Unsupervised extraction of single-document summarization for each public opinion news in the collection;
[0062] Step 3: Analyze all extracted single-document abstracts in the collection to obtain unsupervised multi-document abstracts.
[0063] Further, in the step 1, the automatic division of news collections according to network hotspots is specifically, obtaining hotspots from the Internet, such as Weibo hotspots, Baidu hotspots, WeChat hotspots, etc., using the hotspots as query sentences, and using search engines to collect For the news related to the hot spot, establish hot spot-news, a relationship between one hot spot and multiple news, so as to divide...
Embodiment 2
[0092] A method for generating unsupervised multi-document summarization oriented to public opinion analysis, the generating method comprising the following steps:
[0093] Step 1: Collect network public opinion news in real time, and automatically divide news collections according to network hotspots;
[0094] Step 2: Unsupervised extraction of single-document summarization for each public opinion news in the collection;
[0095] Step 3: Analyze all extracted single-document abstracts in the collection to obtain unsupervised multi-document abstracts.
[0096] The purpose of this step is to generate a text summary with fluent sentences, low redundancy, and the core content of the document collection based on the multiple single-document summaries output in step 2. Unsupervised, generative, and multi-document, these three characteristics meet the needs of public opinion analysis, so the supervised generative multi-document summarization method is used to analyze the public opi...
Embodiment 3
[0131] The difference between this embodiment and embodiment 2 is that in the step 2, an unsupervised algorithm model is adopted, no manual data labeling is required, and the consumption of manpower and time costs for labeling data is avoided, and the data obtained in step 1 is directly used as a training corpus, which can Fully tap the data potential of the large-scale corpus crawled from the Internet;
[0132] This step adopts the extractive summarization method to identify a series of sentences that are strongly related to the core theme of the article from the original news text. Afterwards, it needs to be sent to step 3. If the generative abstract method is adopted, it is easy to get the output of unsound sentences, which will cause error propagation and affect the overall performance of the method;
[0133] This step adopts the single-document summarization method, which is considered for the subsequent multi-document summarization task. Due to the long text length of pu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com