Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for generating tweet summaries based on topic relevance

A correlation and topic technology, applied in the field of text summarization in natural language processing, can solve the problem of not introducing specific topics and social network data, and achieve the effect of rich information, good novelty, and reduced redundant information

Active Publication Date: 2022-05-03
CHONGQING UNIV OF POSTS & TELECOMM
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem of not introducing specific topics and social network data in the existing abstract generation methods, the present invention establishes large-scale master social network data of different topics based on statistics, and then designs a method for abstract generation based on thesaurus

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for generating tweet summaries based on topic relevance
  • A method for generating tweet summaries based on topic relevance
  • A method for generating tweet summaries based on topic relevance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Considering the thematic nature of social network data and the sparseness of data, most studies despersionized tweets based on the topic, and then summarized the tweets after screening. While abstracting tweets for a given topic, the resulting abstract should have better topic relevance, and previous studies have often studied the summary of abstracts and the coverage of the source text, and few people have taken into account the topic relevance of abstracts. In social network data, people make a certain statement, usually related to a topic, different users and different time periods of social network data, the topic of discussion is also different. For a summary of a piece of speech, if the subject of the abstract is specified, we must want to get a summary that is more relevant to the topic. Thus, the present invention has been designed with a summary method of considering the subject matter. This method predefines several transcendental topics and thesaurus on the basis...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for generating tweet summaries based on topic correlation, which includes establishing a thesaurus for each topic through the distribution of nouns in each topic; and calculating The correlation between a tweet and a topic; calculate the public recognition degree according to the network interaction information; combine the public recognition degree and the topic correlation to get the final tweet significance; use the maximum marginal correlation algorithm for deduplication processing, Output summary. This method selects tweets as summaries from topic relevance and tweet salience, and controls the redundancy of the final summaries, so that the generated tweet summaries comprehensively consider the summarization topics, diversity, and social identity. As a result, abstracts with higher topic relevance, novelty and summarization are obtained.

Description

Technical field [0001] The technical field involves text summary technology in natural language processing, which is used to automatically generate a summary of the topic of Twitter speech. Specifically, given a particular topic and several tweet texts, a summary related to that topic is obtained. Background [0002] With the rapid development of social network media and self-media, abstract research on summarizing massive data has been spawned. Since there is no large-scale public dataset of social network data, most of the current summary studies of social network data are traditional unsupervised methods. Methods based on statistical features, mainly based on the relative position of sentences, word frequency characteristics, etc. to study, such methods are easy to implement, but the obtained features are often relatively simple; based on the method of graph model, such methods regard the sentences in the text as nodes, the similarity score between the texts as the edges betw...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/216G06F40/284G06Q50/00
CPCG06F40/216G06F40/284G06Q50/01
Inventor 陈子忠曹洋洋夏书银
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products