Hotspot information finding method and system

A hotspot information and discovery method technology, applied in the field of hotspot information discovery method and system, can solve the problems of single feature, undiscoverable, low efficiency, etc., and achieve the effect of saving reading time and improving speed

Active Publication Date: 2016-11-23
IFLYTEK CO LTD +1
View PDF7 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, if two words are far apart in the text but semantically closely related, existing methods cannot discover this connection
In addition, the existing methods only use the shortest path to measure the importance of each node when calculating the importance of each node, and the features are relatively single
The words with high importance obtained by using existing methods may not necessarily represent the semantic information of the original text
When calculating the importance of each node at the same time, it is necessary to calculate all the shortest paths in the network every time, which is inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hotspot information finding method and system
  • Hotspot information finding method and system
  • Hotspot information finding method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0055] Such as figure 1 Shown is a flowchart of a method for discovering hotspot information in an embodiment of the present invention, including the following steps:

[0056] Step 101, acquire text to be processed.

[0057] Step 102, perform word segmentation and part-of-speech tagging on the text to be processed.

[0058] For example, a conditional random field-based method may be used to perform word segmentation and part-of-speech tagging on the text to be processed. Of course, other methods can also be used for word segmentation and part-of-speech tagging. For example, the longest word matching can be used for word segmentation, and methods based on HMM (Hidden Markov Model, Hidden Markov Model) can be us...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hotspot information finding method and system. The method includes the steps of obtaining a to-be-processed text, conducting word dividing and part-of-speech tagging on the to-be-processed text, conducting parsing on the text subjected to word dividing to obtain a dependence syntactic tree of each sentence in the to-be-processed text, removing stop words in the dependence syntactic tree of each sentence in the to-be-processed text to obtain to-be-analyzed dependence syntactic tree, establishing a small world network through the to-be-analyzed dependence syntactic tree, conducting hotspot analysis according to the to-be-analyzed dependence syntactic tree and the small world network, and obtaining hotspot information in the to-be-processed text according to the hotspot analysis result. By means of the method and system, the hotspot information in the to-be-processed text can be efficiently and accurately found.

Description

technical field [0001] The invention relates to the technical field of data mining, in particular to a method and system for discovering hotspot information. Background technique [0002] With the rapid development of the Internet and the continuous improvement of storage technology, more and more text information is flooding around us. However, there is a lot of redundancy in this information, and reading step by step will obviously waste a lot of time and energy for users. The hotspot analysis method can quickly extract key vocabulary or sentence information from a large amount of text information, that is, hotspot information, so that users can quickly and easily understand the important information contained in the text, which has become a research hotspot for researchers. Therefore, how to efficiently and accurately conduct hotspot analysis on the text and find the corresponding hotspot information in the text to be processed has become the primary task of hotspot anal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F17/27
Inventor 吴及侯晋峰胡国平吕萍王影胡郁刘庆峰
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products