Data enhancement-based out-of-distribution abnormal sample detection method

A technology of abnormal samples and detection methods, applied in the direction of instruments, character and pattern recognition, computing models, etc., can solve problems such as model prediction uncertainty, model security, etc.

Pending Publication Date: 2021-09-14
NANJING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a method for detecting out-of-distribution abnormal samples based on data enhancement to solve the technical problems of uncertainty in model prediction and low model security

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data enhancement-based out-of-distribution abnormal sample detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to better understand the purpose, structure and function of the present invention, a method for detecting out-of-distribution abnormal samples based on data enhancement of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0024] Such as figure 1 Shown, the present invention comprises the following steps:

[0025] Step 1, feature extraction, use the automatic encoder algorithm to extract the feature vector of the sample picture in the input distribution.

[0026] Step 1.1. Map the input sample picture to the feature space Z through the encoder to obtain the abstract feature z;

[0027] Step 1.2, map the abstract feature z back to the input space X to obtain reconstructed samples;

[0028] Step 1.3. Simultaneously optimize the encoder and decoder by minimizing the reconstruction error to obtain the optimal model parameters;

[0029] Step 1.4, output the abstract feature representation z* for the sa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an out-of-distribution abnormal sample detection method based on data enhancement. A feature extraction stage: extracting feature vectors of samples in input distribution by using an encoder part of an automatic encoder algorithm; a data enhancement and reconstruction stage: a feature-based data enhancement method is used for transforming the features extracted in the feature extraction stage, and a decoder part of an automatic encoder is used for generating a sufficient number of auxiliary distribution external abnormal sample data sets containing semantic information from enhanced feature vectors; a sample marking stage: the prediction accuracy of an original classifier on a reconstructed sample is used as a soft label value of an abnormal sample outside distribution; and a classifier retraining stage: the classifier is retrained in combination with the in-distribution training data set containing the hard label supervision signals and the out-of-distribution abnormal sample data set containing the soft label supervision signals. According to the method, the uncertainty of the deep neural network model during prediction of the out-of-distribution abnormal samples is improved, and the safety of the model is improved.

Description

technical field [0001] The invention belongs to the field of credible machine learning, and in particular relates to a method for detecting out-of-distribution abnormal samples based on data enhancement. Background technique [0002] When machine learning classifiers are applied to real-world tasks, they tend to fail when the training and test dataset distributions are different. Worse, these classifiers often fail quietly while delivering highly confident predictions that are, unfortunately, incorrect. If classifiers can't point out when they might be wrong, it can limit their use and even lead to serious accidents. For example, a medical disease diagnosis model might consistently classify with high confidence, even though it should flag difficult examples that require human intervention. The resulting unlabeled misdiagnoses could hinder the future development of machine learning techniques in medicine. More generally and importantly, estimating when a model is wrong is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62G06N20/00
CPCG06N20/00G06F18/211G06F18/241
Inventor 王崇骏姜文玉杜云涛朱志威李宁
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products