Parallel association rule based topic relation finding method and finding device

A discovery method and topic technology, applied in the field of topic relationship discovery method and discovery device based on parallel association rules, which can solve the problems of lack of in-depth or root cause analysis, staying at the level of relevant data statistics, etc.

Inactive Publication Date: 2017-10-17
CHINA UNIV OF PETROLEUM (EAST CHINA)
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] All in all, most of the current research work is focused on the improvement of the text representation model or the improvement of the clustering algorithm. Although these algorithms have achieved certain results in the relationship discovery of multiple topics, most of the existing methods still use the similarity Calculate the relationship between topics, and most of the work is still at the statistical level of relevant data, lacking in-depth or root cause analysis of specific events

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel association rule based topic relation finding method and finding device
  • Parallel association rule based topic relation finding method and finding device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035]The basis of this embodiment is that, due to the variety and complexity of topics generated within a period of time, there may be some potential connections among multiple topics. Therefore, the relevant data information of the corresponding topic screened out according to the relevant keyword information of multiple topics can satisfy the acquisition of all detailed information of the current topic, and the initial keywords of each subtask are obtained through matching and screening, and 1_frequent keywords A set refers to a keyword set composed of all keywords reaching the support threshold in data related to a topic. On this basis, a 1_item frequent keyword set is formed.

[0036] On the basis of the obtained 1_item frequent keyword set, set the support threshold and confidence threshold, and then obtain 2_item candidate keyword set, 2_item frequent keyword set, and 2_item associated keyword set ,..., n_item candidate keyword sets, n_item frequent keyword sets, n_ite...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a parallel association rule based topic relation finding method. The parallel association rule based topic relation finding method acquires an association keyword set by adoption of a parallel association rule algorithm on the basis of acquiring a large-scale frequent keyword set, performs screening and combination to form topic relevant information, and finds an association relation among a plurality of topics. The embodiment of the invention also provides a parallel association rule based topic relation finding device. The technical scheme can accurately and effectively find a potential association relation among the topics, and search underlying or root causes of specific event occurrence.

Description

technical field [0001] The invention relates to a topic relationship discovery method, in particular to a topic relationship discovery method and discovery device based on parallel association rules. Background technique [0002] Aiming at the huge and disorderly data information, various information analysis techniques are used to quickly mine the "unknown" data information, and discover the correlation between multiple topics, which can provide support for topic analysis and relationship mining. Therefore, how to more accurately and quickly discover potential correlations between related topics from a large amount of low-value density network data information is a hot issue worthy of research. [0003] Traditional research mainly expresses reports with Vector Space Model (VSM), and then uses cosine similarity to calculate the similarity between topics and reports and combines clustering algorithms to summarize related topic information and feed back to users. However, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/3331G06F40/30
Inventor 刘昕王奕文李忠伟王丰曹帅邹苹钧
Owner CHINA UNIV OF PETROLEUM (EAST CHINA)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products