Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Concept drift detection method based on classification error rate and consistency prediction

A technology for classification errors and concept drift, which is applied in the direction of platform integrity maintenance, character and pattern recognition, machine learning, etc., to achieve the effect of reducing the amount of calculation

Active Publication Date: 2020-12-25
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to solve the technical problems of concept drift detection of machine learning classification models in the field of software analysis in information security, and creatively propose a concept drift detection method combining classification error rate and consistency prediction, which is useful for evaluating current information security The performance and sustainability of the rapidly increasing security detection or evaluation systems based on machine learning classification algorithms in the field in the face of ever-evolving malware have important practical significance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Concept drift detection method based on classification error rate and consistency prediction
  • Concept drift detection method based on classification error rate and consistency prediction
  • Concept drift detection method based on classification error rate and consistency prediction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The method of the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments.

[0019] A concept drift detection method combining classification error rate and consistency prediction, combining classification error rate change detection and consistency predictor, capable of detecting arbitrary machine learning classification models caused by sudden concept drift and gradual concept drift degradation problem.

[0020] The method of the present invention includes two parts: sudden concept drift detection based on classification error rate and progressive concept drift detection based on consistency predictor, the process of which is as follows figure 1 shown.

[0021] Step 1: Use the classification model to classify the samples.

[0022] The data of the test data set is input into the classification model in turn for classification, and the input sample sequence is recorded as (x i ,y i ), x i is the mult...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a concept drift detection method based on a classification error rate and consistency prediction, and belongs to the technical field of computer machine learning and informationsecurity. According to the method, mutation type concept drift is detected by calculating a change of the classification error rate of a model, and then the progressive concept drift is detected by calculating a consistency degree of the samples with wrong classification and the samples with correct classification so that the mutation type concept drift and the progressive concept drift can be detected in time, and relatively low calculation overhead is kept. According to the method, detection of mutation type concept drift and progressive concept drift is achieved at low calculation cost, and a model degradation phenomenon is recognized in time. The method is mainly used for concept drift detection, can effectively act on early judgment of a degradation phenomenon of a machine learning classification model, and can be used as a performance monitoring method in various application fields such as automatic analysis and decision in a big data environment.

Description

technical field [0001] The invention proposes a concept drift detection method based on classification error rate and consistency prediction, which belongs to the technical field of computer machine learning and information security. Background technique [0002] With the advent of the big data era, the application of machine learning algorithms in many fields has developed rapidly. Among them, information security based on big data technology is one of the most active research hotspots in recent years. A large number of machine learning classification models such as code clone detection models, malicious code classification models, vulnerability prediction models, and defect prediction models have emerged. These models are in High accuracy was achieved during construction. [0003] In the field of information security, software has evolved over time and different categories have emerged. Taking malicious software as an example, it can be divided into major categories such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/57G06F21/55G06K9/62G06N20/00
CPCG06F21/577G06F21/554G06N20/00G06F18/2415
Inventor 王勇彭金雪张继刘振岩薛静锋林珂卉
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products