Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Feature selection method based on fuzzy set feature entropy calculation

A feature selection method and feature entropy technology, applied in the field of security data analysis, can solve problems such as a large number of features, high computational complexity, and the number of feature values, and achieve the effect of reducing computational complexity

Inactive Publication Date: 2020-01-14
HARBIN ENG UNIV
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the problem of large number of features and high calculation complexity in the field of network security analysis. In order to solve the problem of evaluating the importance of features in network security analysis data, although the classical entropy value theory can directly evaluate the effect of individual features on classification results Contribution, but the traditional entropy calculation method is related to the number of values ​​of the feature, and can only deal with discrete features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Feature selection method based on fuzzy set feature entropy calculation
  • Feature selection method based on fuzzy set feature entropy calculation
  • Feature selection method based on fuzzy set feature entropy calculation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The invention proposes a feature subset selection method based on fuzzy set feature entropy calculation. The technical solution of the present invention will be further described in detail through the following specific embodiments.

[0034] In order to verify the effectiveness of the present invention, experiments are conducted on the security analysis data UNSW-NB15 to observe the feature importance evaluation process, and compare the performance of the proposed method with the classic feature importance evaluation methods such as ReliefF, Laplacian and luukka. In order to verify the performance of the present invention on data sets other than the safety analysis data set, this paper also tests on the chronic kidney disease data set.

[0035] (1) Introduction to UNSW-NB15 dataset

[0036] This article uses the UNSW-NB15 dataset, which is a real mixed dataset of normal user behavior and attack behavior in contemporary networks generated by the CyberRange Lab (CRL) of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the field of security data analysis, and particularly relates to a feature selection method based on fuzzy set feature entropy calculation. The method mainly comprises the steps of calculating an ideal vector matrix, calculating a similarity matrix, calculating entropy of features, calculating a scaling factor SFi, and calculating a scaling factor SFi. According to the method, the distance between the ideal vectors in each category is calculated by using the scaling factors of the features and the entropy between the specific categories, feature selection can be optimized, and the calculation complexity is reduced. According to the method, a fuzzy set information entropy calculation method FIEE is adopted to solve the problem that in a traditional information gainand information gain ratio calculation method, calculation cannot be conducted due to the fact that the feature value space is huge. According to the method, the calculation complexity can be greatlyreduced.

Description

technical field [0001] The invention belongs to the field of security data analysis, and in particular relates to a feature selection method based on fuzzy set feature entropy value calculation. Background technique [0002] In recent years, with the continuous increase in the number of Internet users, network security has also become a topic of great concern. Intrusion detection system (IDS) is an effective method for fighting malicious behaviors of network attackers, and the decisive factor for determining the performance of IDS is whether there is a corresponding efficient and accurate classification model, and how to target security Analyzing the characteristics of the data set and building a suitable classification model has also become a research hotspot in this field. A large number of redundant features is the biggest feature of security analysis data. In the process of data analysis, irrelevant features will confuse the classifier, which not only increases the comp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/2113G06F18/214G06F18/2411
Inventor 郭方方孙思佳赵天宇吕宏武冯光升王瑞妮王欣悦何迪
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products