Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for applying multi-criterion fusion to feature selection of high-dimensional small sample data

A technology that integrates application and feature selection. It is used in instruments, character and pattern recognition, computer components, etc., and can solve problems such as poor algorithm stability.

Inactive Publication Date: 2016-12-07
XIHUA UNIV
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the selection results of most current feature selection algorithms are very sensitive to changes in the training set, that is, the algorithm is less stable.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for applying multi-criterion fusion to feature selection of high-dimensional small sample data
  • Method for applying multi-criterion fusion to feature selection of high-dimensional small sample data
  • Method for applying multi-criterion fusion to feature selection of high-dimensional small sample data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0022] Embodiment: the method for applying multi-criteria fusion to high-dimensional small sample data feature selection includes the following steps:

[0023] Step 1): The specific method of clustering in step 1) is to cluster the training samples by using the k-means clustering method, wherein the kth clustering uses the set G k to represent, assuming G k Contains n data{x 1 ,x 2 ,...x n}, the task of k-means clustering is to find a set of m representative points Y={y 1 ,y 2 ,...y m} so that the objective function The smaller the better, where y k yes k The cluster center and the number of clusters are determined by experiments. The value of k described in this embodiment is 8.

[0024] Step 2): Use the Fisher Ratio method and the ReliefF method to perform feature selection on the clustered samples respectively;

[0025] Step 3): Fusion feature selection results and different penalty factors are added to each class, and then the fusion result is used to train the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for applying multi-criterion fusion to feature selection of high-dimensional small sample data. The method comprises the following steps of 1) initializing a sample data set and performing clustering on the sample data set; 2) performing feature selection on clustered samples by using a Fisher Ratio method and a Relief F method; 3) fusing feature selection results, applying different penalty factors to classes, and training a PSVM classifier by adopting a fused result; 4) performing regression on the sample data set by adopting the trained classifier, removing features with minimum correlativity, and updating the sample data set; and 5) judging whether coding is ended or not, and if yes, ending the iteration process, or otherwise, repeating the steps 2-4 until the feature selection is realized. According to the method for applying the multi-criterion fusion to the feature selection of the high-dimensional small sample data, in the field of the feature selection of the high-dimensional small sample data, the feature selection speed is remarkably increased, the feature selection efficiency is remarkably improved, and the stability of a feature selection result is greatly enhanced.

Description

technical field [0001] The invention relates to a special selection method, in particular to a method for applying multi-criteria fusion to feature selection of high-dimensional small-sample data. Background technique [0002] Feature selection is one of the core issues in the field of pattern recognition, and its research has attracted the attention of scholars all over the world. The feature selection algorithm achieves the purpose of reducing the feature dimension by selecting effective features reasonably, which can not only eliminate information redundancy, improve classification efficiency, and speed up calculation, but also reduce the complexity and classification error rate of classifiers. At present, feature selection methods have been widely used in image retrieval, text classification and gene analysis. However, the selection results of most current feature selection algorithms are very sensitive to changes in the training set, that is, the algorithm stability is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/2411G06F18/253
Inventor 江竹雷震宇
Owner XIHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products