Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data feature selection method based on artificial bee colony algorithm

An artificial bee colony algorithm and data feature technology, applied in the field of data processing, can solve the problems of not considering the minimum number of optimal features, rarely used, etc., to achieve the effect of improving interpretability, reducing difficulty, and improving search performance

Inactive Publication Date: 2017-05-10
DONGHUA UNIV
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this is impossible for large datasets with many features, so this method is rarely used in practical applications
In addition, when using the encapsulation formula to select the optimal feature subset, the minimization of the optimal feature number is not considered

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data feature selection method based on artificial bee colony algorithm
  • Data feature selection method based on artificial bee colony algorithm
  • Data feature selection method based on artificial bee colony algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. In addition, it should be understood that after reading the teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

[0025] The embodiment of the present invention relates to a data feature selection method based on the artificial bee colony algorithm. The present invention uses the artificial bee colony algorithm (Artificial Bee Colony Algorithm) to solve the typical combinatorial optimization problem of data feature selection. The fitness function affected by the number of optimal features can not only improve the accuracy of the learning a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a data feature selection method based on an artificial bee colony algorithm. The method comprises the steps that the control parameter of the artificial bee colony algorithm is determined, and appointment normalization processing is carried out on an acquired data set; a set of bee position is initialized; a fitness function is selected according to selected features to calculate the fitness value of each bee, and the corresponding mining number of times is set to zero; based on the update method of the artificial bee colony algorithm, the position of the bees is updated, and the fitness value of a new individual is calculated and the number of times of mining is updated; a probability model function is calculated and selected; a bee is selected as an observation bee; the position of the observation bee is updated; the fitness value of a new individual is calculated and the number of times of mining is updated; the number of times of mining is observed, and a bee position update mechanism is carried out; the optimal solution position so far is kept, and represents the optimal feature subset; and if the maximum number of times of iteration is up, the optimal feature subset is output, otherwise the steps are repeated. According to the invention, the complexity of a feature selection method can be reduced.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a data feature selection method based on an artificial bee colony algorithm. Background technique [0002] In recent years, data mining technology has been widely used in the fields of business intelligence, biomedicine and genetic testing, and how to reduce the dimensionality from large-scale data to obtain effective simplified data is becoming more and more important. In many practical applications, data sets stored in databases often have thousands or even tens of thousands of features, but not all features are helpful for discovering important information hidden behind the data. Since only a small part of the features represent the distribution characteristics of the entire high-dimensional feature space, these features that need to be deleted not only increase the interference in the process of knowledge discovery by the learning algorithm, but also increase the comp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/00
CPCG06N3/006
Inventor 陈杰周武能陆康迪
Owner DONGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products