Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Domain knowledge-based multilayer association rules mining method and system

A technology of domain knowledge and rules, applied in the field of multi-layer association rule mining methods and systems, can solve problems such as a large number of candidate item sets, mining processing and large space for recursive operations, achieve good execution efficiency and scalability, ensure correctness and The effect of completeness

Inactive Publication Date: 2015-01-14
GUANGZHOU INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF4 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In general, the disadvantage of the breadth-first algorithm is that it needs to generate a large number of candidate item sets and scan the database multiple times.
But it has application difficulties. When dealing with large and sparse databases, mining processing and recursive operations still require a lot of space.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Domain knowledge-based multilayer association rules mining method and system
  • Domain knowledge-based multilayer association rules mining method and system
  • Domain knowledge-based multilayer association rules mining method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0046] like figure 1 As shown, this embodiment discloses a method for mining multi-layer association rules based on domain knowledge, including the following steps:

[0047] S1. Taking domain knowledge as basic data, constructing a domain correlation model according to the correlation of the basic data;

[0048] S2. Taking domain knowledge as basic data, constructing a structural classification layer based on the basic data;

[0049] S3. Clustering and storing items on the basis of the structural classification layer, thereby generating a clustering layer of items and constructing an original transaction database; wherein, the data stored in the original transaction database corresponds to domain knowledge one by one;

[0050] S4. Perform hierarchical classification on the original transaction database, and map this hierarchical classification to a frequent pattern tree to construct a frequent pattern tree structure, which may specifically be:

[0051] S41. Coding and descri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of data mining and specifically discloses a domain knowledge-based multilayer association rules mining method and system. The method comprises the following steps: taking the domain knowledge as basic data and constructing a domain correlation model according to the correlation of the basic data; taking the domain knowledge as the basic data and constructing a structure classification layer according to the basic data; performing clustering storage on the items on the basis of the structure classification layer, thereby generating an item clustering layer and constructing an original transactional database; performing layer classification on the original transactional database and mapping the layer classification into a frequent pattern tree for constructing a frequent pattern tree structure; searching the frequent pattern tree, thereby acquiring a result of a frequent item set. According to the invention, the correctness and completeness of the mining result of the frequent item set are ensured, and the method, compared with the present similar latest mining algorithm, has better executing efficiency and expansibility.

Description

technical field [0001] The invention belongs to the technical field of data mining, and specifically relates to a method and system for mining multi-layer association rules based on domain knowledge. Background technique [0002] In recent years, with the rapid growth of the amount of data, the data mining technology that automatically searches for the special correlation hidden in a large amount of data has emerged as the times require. Data mining technology is the result of long-term research and development of database technology. At first, it was just access and query to the database stored in the computer. Entering the era of massive data, the relevant technologies of data mining are extended to query and traverse data, find potential connections before data, and promote the transmission of information. [0003] Different from the algorithm based on the Aprior idea, since the multi-layer association rule mining problem was proposed, an algorithm based on the FP-Growt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/212
Inventor 孟振宇吴晓鸰王慰李建军
Owner GUANGZHOU INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products