Method for mining frequent weighting item set based on effective weighting tree

A technology of frequent itemsets and itemsets, which is applied in the field of data processing, can solve problems such as running failures, high memory usage of algorithms, and failure to obtain frequent weighted itemsets, etc., to achieve the effect of reducing running memory and quickly and effectively mining

Active Publication Date: 2019-08-06
CHONGQING UNIV
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In 2009, B le, H Nguyen and B Vo proposed the WIT-tree method to mine frequent weighted itemsets, but its transactions per frequent itemsets, so the memory usage of the algorithm is high
In 2013, B Vo, F Coenen and B Le proposed the WIT-diff algorithm, using a difference set strategy to reduce memory usage and speed up the process, but in databases with a large number of items, it will run due to memory overflow Fails to get frequent weighted itemsets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for mining frequent weighting item set based on effective weighting tree
  • Method for mining frequent weighting item set based on effective weighting tree
  • Method for mining frequent weighting item set based on effective weighting tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be further described in detail below in conjunction with examples and specific implementation methods. However, it should not be understood that the scope of the above subject matter of the present invention is limited to the following embodiments, and all technologies realized based on the content of the present invention belong to the scope of the present invention.

[0031] figure 1 A method for mining frequent weighted itemsets based on an effective weighted tree according to an exemplary embodiment of the present invention, specifically includes the following steps:

[0032] S1: Read the database D from the network, the database D contains N transactions, each transaction contains different items, the number of items and the weight w of each item.

[0033] In this embodiment, the database D is read from the network, and the database D includes N transactions, where N≥1 and is a positive integer; each transaction includes different items ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for mining a frequent item set based on an effective weighted tree, comprising the following steps: S1, reading a database D from a network, the database D comprisingN transactions, each transaction comprising different items, the number of items and the weight w occupied by each item; S2, calculating a transaction weight tw of each transaction in the database D,and generating a transaction weight table; S3, calculating the weight W of the item set S, and presetting a threshold value minws, wherein if W is greater than or equal to minws, sthe item set S is afrequent item set; if W < minws, the item set S is a non-frequent item set; and S4, constructing an effective weighted tree model for mining the frequent item set. According to the method, weights aregiven to the item sets, weight calculation is carried out, so that an effective weighted tree model is constructed, the mining efficiency of the frequent item sets is improved, memory application isreduced, and the method is suitable for a database of big data.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for mining frequent weighted itemsets based on an effective weighted tree. Background technique [0002] With the rapid development of computer technology, human beings have entered the era of big data. The processing, analysis and use of data has become increasingly important, and data analysis and data mining have gradually become the leading research fields of the times. Data mining is a key component of the field of knowledge discovery. As the name suggests, data mining is to obtain knowledge from big data. [0003] Association rule mining (ARM, Association rule mining) plays an important role in the field of data mining. ARM is used to identify the relationship between items in transactional databases, and frequent itemset mining plays an important role in it. Mining of frequent itemsets is one of the key technologies of data mining. [0004] In classic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/903G06F16/906
CPCG06F16/906G06F16/903Y02D10/00
Inventor 朱征宇赵福强赵亮杜小东赵芳舟
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products