Network news hotspot mining method and device

A news and network technology, applied in the field of information processing, can solve the problems of poor computing performance and low accuracy, and achieve the effect of speeding up clustering, effectively and accurately mining network news hotspots, and realizing network news hotspot mining.

Pending Publication Date: 2020-05-26
BEIJING UNIV OF POSTS & TELECOMM
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the existing technology, there are often problems of poor computing performance and low accuracy for mining network news hotspots. With the rapid increase of network information, the public's demand for effective access to news information is becoming stronger and stronger.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Network news hotspot mining method and device
  • Network news hotspot mining method and device
  • Network news hotspot mining method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0042] figure 1 It is a schematic flow chart of the network news hotspot mining method described in an embodiment of the present invention, as figure 1 shown, including:

[0043] Step S1, preprocessing the original network news data to obtain network news information;

[0044] Step S2, extracting text feature vectors in the network news in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a network news hotspot mining method and device, and the method comprises the steps: carrying out preprocessing of original network news data to acquire network news information; extracting text feature vectors in the network news information through a bilingual LDA topic model and a bilingual LSA model; and according to the text feature vector in the network news information, performing parallel operation on a Spark platform by utilizing a Single-Pass clustering algorithm to obtain news hot topic information. According to a text feature extraction method based on combination of the bilingual LDA model and the bilingual LSA model, entity information with high distinction degree for each topic is contained in the topic model; according to the networknews hot spot mining method, the semantic relation between text contexts is also considered, and the Spark-based parallelization Single-Pass clustering algorithm is utilized, so that the clustering speed is increased, and the network news hot spot mining is more effectively and accurately realized.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a method and device for mining network news hotspots. Background technique [0002] News topic detection and tracking is an important research branch of TDT (Topic Detection and Tracking) technology. TDT technology is based on the latest research results of natural language processing, aiming at the news data flow in the network, according to the theme and semantic characteristics of the news, they are automatically Divide it into different topics and display it to users in a clear and clear visual form. At the same time, track the dynamic development trend of the topic according to the change of time. [0003] However, in the prior art, the mining of network news hotspots often has the problems of poor computing performance and low accuracy. With the rapid increase of network information, the public's demand for effective access to news information is becoming mor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/34G06F16/9536G06F40/289G06F40/216
CPCG06F16/355G06F16/34G06F16/9536
Inventor 关建峰刘杨许长桥石钰瑗李心舒张婉澂
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products