Machine learning model robustness evaluation method based on noise data

A machine learning and robust technology, applied in the field of model robustness evaluation, can solve problems such as high cost and impact evaluation algorithm, and achieve the effect of comprehensive and perfect cognition

Active Publication Date: 2020-01-21
NANJING UNIV
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If these labels are wrong, it seriously affects the ability to evaluate the algorithm on the development set, and correcting wrong labels will cost a lot

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine learning model robustness evaluation method based on noise data
  • Machine learning model robustness evaluation method based on noise data
  • Machine learning model robustness evaluation method based on noise data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification.

[0036] Such as figure 1 As shown, the method for evaluating model robustness based on noise data in this embodiment mainly includes:

[0037] 1. Model the original data set to get the accuracy of the model. Among them, the original data is a data set with 100% correct labels, and the method of dividing the training set and the test set is 10 times of 10-fold cross-validation. Substituting the original training set into N different supervised learning algorithms, we train N different predictive models. Based on the original test set, we evaluate the accuracy of these N models and obtain the corresponding N accuracy rates.

[0038] 2. Model the noisy data to get the accuracy of the new model. Among them, on the basis of the origina...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a machine learning model robustness evaluation method based on noise data. The method comprises the steps of original data set processing, noise data acquisition, model training, model prediction, accuracy reduction ratio calculation and model robustness evaluation. The original data set processing comprises the steps of collecting an original data set with a correct percentage label, and dividing an original training set and an original test set by adopting 10 times of 10-fold cross validation. The noise data acquisition comprises the following steps: on the basis of anoriginal training set, extracting t'= | D | * alpha data by adopting a stratified sampling method, and replacing a label of the data with an error label, and alpha is a noise data rate. Model training comprises the step of respectively inputting an original training set and a training set mixed with noise data to respectively construct an original model and a new model based on a common classification algorithm. Model prediction includes performing accuracy evaluation on an original model and a new model based on an original test set. Accuracy decline ratio calculation includes calculating arate of decline in accuracy of the new model relative to the original model. Model robustness evaluation comprises the steps of comparing the size of the rate of accuracy reduction in the transverse direction and the longitudinal direction, measuring the robustness of the model, and achieving the standard of judging the robustness of the model.

Description

technical field [0001] The invention belongs to the field of machine learning applications, and in particular relates to model robustness evaluation. We evaluated the robustness of the model by measuring the influence of noisy data on the model. Background technique [0002] Machine learning is an important branch of artificial intelligence research, which learns the representation of features by organizing and fitting parameters. Because of its high generalization ability and efficiency, it is more and more widely used in academia and industry. According to its learning form, it can be divided into: supervised learning, unsupervised learning and semi-supervised learning. Among them, supervised learning is an important aspect of machine learning. It is the process of learning data and its corresponding labels, training an intelligent algorithm, and mapping input data to labels. The data for a supervised learning problem consists of an input X and an output label Y, howeve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/10
CPCG06N20/10Y02T90/00
Inventor 房春荣龚爱王栋陈振宇李玉莹
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products