Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Time Sensitive and Adaptive Subtopic Online Detection Method and System

A time-sensitive and detection method technology, applied in the information field, can solve the problems of not distinguishing the weight of historical documents and the weight of latest documents, unable to adjust the content and number of subtopics, and unable to detect popular events, so as to improve operating efficiency and avoid useless information too much effect

Active Publication Date: 2018-05-22
INST OF INFORMATION ENG CHINESE ACAD OF SCI +1
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The above system has the following problems: First, it does not distinguish between the weight of historical documents and the weight of latest documents
Second: Unable to adjust the content and quantity of subtopics adaptively
Third: The system based on burst detection can only get sudden events, but cannot detect popular events (subtopics), that is, events that the public cares about for a long time (hot subtopics) cannot be detected

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Time Sensitive and Adaptive Subtopic Online Detection Method and System
  • A Time Sensitive and Adaptive Subtopic Online Detection Method and System
  • A Time Sensitive and Adaptive Subtopic Online Detection Method and System

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to elaborate on the purpose, technical solutions and advantages of the present invention, the implementation process of the method proposed by the present invention will be further described in detail below in conjunction with the accompanying drawings and cases. It should be understood that the specific implementations described here are only used to explain the present invention, not to limit the present invention. Various modifications and changes are also included without departing from the scope of the present invention. What needs to be added is that the specific method in the flow chart is only a specific implementation case of the present invention, and each function in the module can also be achieved in other ways.

[0037] Such as figure 1 As shown, the present invention includes a total of four modules: a document representation module, an incremental clustering module, a new subtopic discovery module, and a summary generation module. The function ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a time-sensitive and self-adaptive subtopic online detection method and system. The method includes: 1) performing vectorized representation on each document in the document stream; 2) incrementally clustering the documents, and adjusting the central weight of the subtopic according to the weight of the document decaying with time; 3) when clustering When the number of generated subtopics or the weight ratio of a certain subtopic meet the threshold condition, or when the subtopic meets the long tail detection condition, merge subtopics or delete meaningless subtopics; 4) According to the weight of each new subtopic Given its intrinsic document distribution, it generates summaries for new subtopics and outputs representations. The system includes a document representation module, an incremental clustering module, a new subtopic discovery module, and a summary generation module. In the present invention, the weight of historical documents decays with time, and the number and content of subtopics are dynamically updated based on threshold judgment and long tail detection, which can effectively improve the efficiency of subtopic detection.

Description

technical field [0001] The invention belongs to the field of information technology, and specifically relates to a time-sensitive and self-adaptive subtopic online detection method and system, which can be applied to emergency event detection, subtopic analysis, public opinion analysis, social media data mining and other fields. Background technique [0002] Weibo is short for Microblog. By registering a Weibo account, users can follow friends, celebrities, institutions, etc., so that different users can establish network relationships. The news flow of Weibo is full of various things, but different social entities pay attention to completely different content. For example, product companies pay attention to the real-time word-of-mouth of related products on the Internet, and famous people pay attention to their public opinion image among netizens. with impact. Therefore, online subtopic detection for specific target entities based on social networks has attracted great at...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27G06K9/62G06Q50/00
CPCG06F16/951G06Q50/01G06F40/284G06F18/2321
Inventor 李思旭李锐包秀国马宏远杨文静邱泳钦程工刘春阳庞琳王斌
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products