Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Emergent event abstract extracting method based on sparse learning

A technology for emergencies and events, applied in the field of abstract extraction of emergencies based on sparse learning, it can solve the problems of too much redundancy and insufficient description of emergencies, and achieve the effect of improving efficiency.

Active Publication Date: 2017-03-08
BEIJING UNIV OF TECH
View PDF2 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method of summarization does not solve the problem from the essence of the data set - in the face of massive redundant news data streams, finding related events on a specific topic is like finding a needle in a haystack, and the number of topics in the same period is staggering, so it must be taken Effective feature extraction method, select the topic set that can reflect the minimum redundancy that can represent events in this period
In addition, traditional abstract extraction techniques often ignore the semantic relationship between news text data samples, and only focus on the score of a single sentence. In this way, only the sentence with the highest score is extracted as the final abstract sentence. Although the accuracy of a single sentence is high, it is often In general, descriptions of emergencies are often insufficiently comprehensive or redundant

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Emergent event abstract extracting method based on sparse learning
  • Emergent event abstract extracting method based on sparse learning
  • Emergent event abstract extracting method based on sparse learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention will be described in detail below in conjunction with specific embodiments shown in the accompanying drawings.

[0023] Such as figure 1 As shown, the embodiment of the present invention provides a method for extracting emergency event summaries based on sparse learning theory, including:

[0024] Step S1, obtain 21 emergency topics provided by TREC 2015Temporal Summarization track, perform query expansion on each emergency topic, and obtain the extended topic term set of the event topic;

[0025] Step S2, first decrypt, decompress, parse, and convert the TREC-TS-2015F-RelOnly data set into TREC format data, then use the language model tuned in Lemur as the retrieval model, and query the extended extension according to each event The topic term retrieves each event and obtains a collection of documents related to each event topic;

[0026] Step S3, using the non-negative matrix factorization method to sequentially perform feature selection and se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an emergent event abstract extracting method based on a sparse learning theory. The method comprises the following steps: acquiring emergent event topics, and performing query extension on the emergent event topics to obtain an extension topic lexical item set of the event topics; retrieving events according to extension topic lexical items after the query extension of the events to obtain document sets relevant to the event topics; performing feature selection and semantic clustering on the document sets of the topics in sequence by a non-negative matrix decomposing method to obtain topic clustering results of the events; and extracting a representative sentence from the clustering results to serve as a finial abstract result according to a maximal marginal relevance (MMR) method. Through the technical scheme provided by the invention, when the emergent events burst, latest condition information of event development is provided for a user in the presence of massive redundant news report streams.

Description

technical field [0001] The invention belongs to the field of text information processing, and relates to a method for extracting emergency abstracts based on sparse learning. Background technique [0002] Living in the information age, it is no longer a problem for users to obtain massive reports on events and topics they are interested in. However, big data does not mean big knowledge, especially when emergencies occur, and the number of related event reports explodes. In this way, how to efficiently, timely and conveniently track the development status of specific topic emergencies from the massive news data stream with geometric growth, and finally form a summary of event development that is easy for readers to read, so as to help people learn from numerous It has become a very urgent task to quickly obtain the latest development status of emergencies that one is interested in in news reports. [0003] Most of the traditional multi-document summarization methods filter ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27G06N5/02
CPCG06F16/35G06F16/9535G06F40/30G06N5/022
Inventor 杨震姚应哲
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products