Feature selection and classification method based on maximum information coefficient and feature selection and classification device based on maximum information coefficient

A technology of maximum information coefficient and feature selection method, applied in the field of wireless communication, can solve problems such as limiting performance and difficult to eliminate nonlinear redundancy, and achieve the effect of reducing data dimension, complexity, processing time and space

Active Publication Date: 2014-09-17
HARBIN UNIV OF SCI & TECH
View PDF3 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the traditional feature subset selection methods select features based on the linear correlation between features, and it is difficult to eliminate the ubiquitous nonlinear redundancy, which limits the performance improvement of this type of method.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature selection and classification method based on maximum information coefficient and feature selection and classification device based on maximum information coefficient
  • Feature selection and classification method based on maximum information coefficient and feature selection and classification device based on maximum information coefficient
  • Feature selection and classification method based on maximum information coefficient and feature selection and classification device based on maximum information coefficient

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0031] figure 1 A flow chart of a feature selection method based on the maximum information coefficient of the present invention is shown.

[0032] refer to figure 1 , the feature selection method based on the maximum information coefficient of the embodiment of the present invention includes steps:

[0033] S1. Sorting the original features based on the mutual information criterion, deleting the features with a correlation lower than the threshold, and forming the initial feature subset with the features with a correlation higher than the threshold.

[0034] Since there are a large number of redundant and irrelevant features in the network traffic data, the features are first sorted by the degree of correlation between the features and the categories, and the features with strong correlation with the category are retained, and the features with weak c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a feature selection and classification method based on a maximum information coefficient and a feature selection and classification device based on the maximum information coefficient. The feature selection method comprises the following steps: S1, sequencing primitive features based on mutual information criterions, deleting features with correlation degree lower than a threshold value, and forming a primary feature subset by use of features with correlation degree higher than the threshold value; S2, calculating the maximum information coefficient among the features in the primary feature subset; S3, deleting redundancy features in the primitive feature subset according to the maximum information coefficient, thereby obtaining a low-dimension feature subset. According to the feature selection method, feature selection is performed by virtue of a mode of using mutual information and the maximum information coefficient, so that the redundancy features are removed, and the data dimension is reduced.

Description

technical field [0001] The present invention relates to the field of wireless communication, in particular to a feature selection and classification method and device based on maximum information coefficient. Background technique [0002] With the development of science and technology, the scale of data has also increased rapidly, and intelligent analysis and processing of these data has become more and more important. But the ensuing problem is that there is a lot of redundant information in the massive raw data, which poses challenges to the algorithms in the fields of pattern recognition and machine learning. On the one hand, redundant data greatly increases the time complexity and space complexity of the algorithm, wastes storage resources, and increases processing time; on the other hand, due to the large correlation of redundant data, the classification results are biased. Reduced performance of intelligent information processing software. The purpose of feature sele...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/285G06F16/35
Inventor 孙广路何勇军刘广明
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products