Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Interval-valued fuzzy rough set attribute selection method based on Gini indexes

A technology of fuzzy rough set and Gini index, applied in the direction of complex mathematical operations, etc., to achieve the effect of removing redundant attributes, simple calculation, and reducing noise interference

Inactive Publication Date: 2017-12-05
TIANJIN UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are many interval-valued data in real life, and there are few existing methods for interval-valued data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Interval-valued fuzzy rough set attribute selection method based on Gini indexes
  • Interval-valued fuzzy rough set attribute selection method based on Gini indexes
  • Interval-valued fuzzy rough set attribute selection method based on Gini indexes

Examples

Experimental program
Comparison scheme
Effect test

experiment example 1

[0086] By running the method of the present invention on the actual data set fish, its effectiveness is shown. The results of the operation are shown in Table 1 and Table 2: irrelevant and redundant attributes are eliminated, thereby improving data quality and improving the generalization ability of the classifier. Among them, the data set comes from the public UCI data warehouse (http: / / archive.ics.uci.edu / ml); the data set after attribute selection is the original data set to remove the attributes not in the attribute selection; the classification accuracy is The average value of ten cross-validation, the classifier used is KNN (k=5), J48, Random Forest.

[0087] Table 1 The number of attributes after attribute selection and the number of original attributes

[0088]

[0089] Table 2 The correct rate of attribute classification

[0090] threshold

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides an interval value fuzzy rough set attribute selection method based on Gini index, comprising the following steps: Step 1, selecting an interval value decision system IVDS=(U, C∪D), where U is the universe of discourse and C is a conditional attribute Set, D is the decision attribute set, giving the similarity rate α and the stopping condition ε; step 2, use the RBD similarity to construct the similarity matrix of each object in the universe U in step 1; use the similarity rate α to obtain the universe U The α-similar class of each object ui relative to other objects uj in and other steps; this method introduces the Gini index into the rough set, defines the attribute importance formula, and proposes an attribute selection algorithm for interval values.

Description

technical field [0001] The invention relates to an attribute selection method, in particular to an interval value fuzzy rough set attribute selection method based on the Gini index. Background technique [0002] In reality, the results of data collection are often accompanied by noise data, which makes uncertain mathematical tools particularly important. Compared with other theories dealing with uncertain and imprecise problems, rough set theory does not need to provide any prior knowledge other than the data set that the problem needs to deal with. Due to the superiority of rough set in dealing with uncertain data, it has been widely used in many fields such as classification and clustering, among which attribute selection is one of the most important applications. Attribute selection can eliminate redundant, irrelevant attributes from a large number of attributes, thereby improving data quality, speeding up data processing, and improving the generalization ability of clas...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/18
CPCG06F17/18
Inventor 代建华郑国杰
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products