Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Multi-classifier training method and classifying method based on non-deterministic active learning

A multi-classifier and active learning technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of inability to achieve classification performance, inability to accurately describe the true distribution of sample data, etc., and achieve comprehensive and effective measurement and classification Effect optimization, avoiding the effect of information redundancy

Active Publication Date: 2015-05-27
INST OF INFORMATION ENG CAS
View PDF11 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional multi-classification method based on deterministic active learning only focuses on model tuning and ignores model changes, so it is only suitable for application scenarios where the number of categories is known; while the number of categories is uncertain, the multi-classification method based on deterministic active learning The method is limited to the evaluation of the amount of sample information under the existing N classification model, but it cannot accurately describe and fit the real distribution of the sample data, so that it cannot effectively improve the classification performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-classifier training method and classifying method based on non-deterministic active learning
  • Multi-classifier training method and classifying method based on non-deterministic active learning
  • Multi-classifier training method and classifying method based on non-deterministic active learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0079] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0080] Example based non-deterministic active learning method for multi-classification

[0081] The multi-classification method based on non-deterministic active learning provided by the invention realizes the gradual optimization of the classification model through a cyclic iterative process.

[0082] Assuming that each round of loop iteration needs to label K samples, the following process is executed inside each round of loop iteration:

[0083]

[0084]

[0085] After the method is executed, if the number of loop iterations is M, the total number of samples marked by experts through human-computer interaction is K×M.

[0086] Taking image classification as an example, the image sample is represented by a ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-classifier training method and a classifying method based on non-deterministic active learning. The method comprises the following steps: 1) selecting or initializing a multi-classifier and calculating the overall information quantity info of each sample in an unlabeled sample set by the utilization of the multi-classifier, wherein the overall information quantity is the sum of model change information quantity and model tuning information quantity; 2) clustering the unlabeled sample set to obtain J subclasses; 3) selecting a plurality of unlabelled samples with minimum overall information quantity Info from each subclass; selecting, labeling and adding K samples from the selected sample into a labeled sample set L; 4) training the multi-classifier again through taking the updated labeled set L as training data; 5) iteratively executing the steps 1)-4) to set the number of times and then classifying the unlabelled set by the utilization of the finally-obtained multi-classifier. According to the multi-classifier training method and the classifying method, the comprehensive evaluation on the sample information quantity is realized, so that the efficient and intelligent classifier is obtained.

Description

technical field [0001] The invention relates to a multi-classifier training method and classification method based on non-deterministic active learning, belonging to the technical field of software engineering. Background technique [0002] Data classification has always been a research hotspot, such as patent ZL 201010166225.6 "an adaptive cascade classifier training method based on online learning", patent ZL 200910076428.3 "a cross-domain text sentiment classifier training method and classification method" , Patent ZL 200810094208.9 "Document Classifier Generation Method and System". [0003] In the classification problem of massive data, "active learning" (reference: McCallum and K.Nigam, "Employing EM in pool-based active learning for text classification," in Proc. of the 15th International Conference on Machine Learning, 1998, pp.350–358.) is a machine learning method that efficiently utilizes expert labeling. Its main idea is: the machine actively and targetedly sele...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/2413G06F18/214
Inventor 张晓宇王树鹏吴广君
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products