Financial news stream emergency detection method based on hierarchical clustering

A detection method and news flow technology, applied in finance, unstructured text data retrieval, text database clustering/classification, etc. Issues such as the source of financial news and public opinion dissemination, to achieve the effect of strengthening reputation risk management, good control and introduction of user needs, and preventing public opinion from getting out of control

Active Publication Date: 2021-09-28
NANJING UNIV OF SCI & TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1. Weak awareness of crisis and insufficient monitoring of financial emergencies;
[0005] 2. The financial emergency response system is not perfect;
[0006] 3. The guidance and handling of online public opinion when a financial emergency occurs is not professional enough
[0021] 4. Major risk events of the company itself;
[0024] From the above content, it can be seen that there are many news data and factors that need to be considered in the monitoring of financial emergencies. Only relying on manpower to analyze and judge cannot satisfy the multi-level, all-round, full-screen, full-network, full-time, all-weather monitoring of financial events; it is impossible to establish a timely response System to investigate the source, path, and scope of financial news and public opinion dissemination; it is impossible to train a large number of relevant personnel at low cost to quickly monitor and handle financial events

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Financial news stream emergency detection method based on hierarchical clustering
  • Financial news stream emergency detection method based on hierarchical clustering
  • Financial news stream emergency detection method based on hierarchical clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0101] In this embodiment, under the experimental environment of Ubuntu18.04 operating system, Python3 programming environment, Intel Core i7-9700CPU, 32G memory, RTX2070GPU, a large-scale financial news stream data set has been fully tested and verified.

[0102] Such as figure 1 As shown, a hierarchical clustering-based financial news flow burst detection method includes the following steps:

[0103] Step S1: text preprocessing; including:

[0104] Step S11: During the period from December 2019 to August 2020, a total of 129,779 pieces of data involving 2,138 major listed company entities and more than 50 reliable sources of financial news streams were captured through web crawlers; the data content includes time stamps , news title, news content, number of releases, URL address and other information;

[0105] Step S12: remove duplicate news by calculating the title edit distance; remove noise data according to timestamp integrity and whether the URL is accessible;

[010...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a financial news stream emergency detection method based on hierarchical clustering. The method comprises the following steps: preprocessing a text; extracting keywords and constructing a keyword co-occurrence graph; clustering the keywords by adopting a bisection K-Means algorithm, dividing the keyword co-occurrence graph into a plurality of sub-graphs, with the keyword in each sub-graph being a financial theme; identifying the financial theme to which each piece of financial news belongs through similarity calculation; constructing an undirected graph with each piece of financial news as a node, clustering the financial news by adopting the bisection K-Means algorithm, dividing the financial news node undirected graph into a plurality of sub-graphs, with the financial news in each sub-graph being a financial event; generating a story chain through similarity calculation; performing emergency detection. According to the method, event clustering is carried out on financial news through natural language processing and graph theory related technologies, the problem that related news of the same event cannot be comprehensively considered in traditional financial emergencies is solved, the financial emergencies are efficiently and accurately detected, and certain industrial value is achieved.

Description

technical field [0001] The invention relates to the field of financial news data mining, in particular to a method for burst detection of financial news streams based on hierarchical clustering. Background technique [0002] Investors are important participants in the financial market. Once a financial emergency breaks out, it will bring disaster to the majority of investors. The detection of financial emergencies helps investors avoid risks. [0003] In recent years, public opinion related to the financial industry has shown a "surge" trend, with a relatively concentrated time of occurrence, a large amount of information interaction, and frequent interactions. The generation, expansion and dissemination of financial public opinion will have an important impact on investors, financial institutions, the financial industry and even macroeconomic operations. Often some small credit crises may lead to financial crisis events. Therefore, monitoring financial public opinion and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F16/33G06F40/194G06F40/242G06F40/289G06K9/62G06Q40/00
CPCG06F16/35G06F16/3346G06F16/3344G06F40/289G06F40/194G06F40/242G06Q40/00G06F18/23213
Inventor 周沧琦陈辉王慧慧杨帆王毓祥
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products