Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semi-supervised learning pseudo label assignment method based on clustering fusion

A semi-supervised learning and labeling technology, applied in the field of pseudo-label assignment of semi-supervised learning, can solve the problems of large manpower and financial costs, and achieve the effect of clear principle, easy code, and easy understanding

Pending Publication Date: 2021-02-26
STATE GRID GASU ELECTRIC POWER RES INST
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in daily life, relying on manual labeling data often consumes a lot of human and financial costs, and unlabeled data is often easily obtained in large quantities. Therefore, in recent years, semi-supervised and unsupervised learning has attracted the attention of researchers. focus on

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised learning pseudo label assignment method based on clustering fusion
  • Semi-supervised learning pseudo label assignment method based on clustering fusion
  • Semi-supervised learning pseudo label assignment method based on clustering fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] refer to Figure 1~2 , which is the first embodiment of the present invention, this embodiment provides a method for assigning pseudo-labels of semi-supervised learning based on cluster fusion, including:

[0038] S1: Construct a feature extraction convolutional neural network, use labeled data and unlabeled data for neural network pre-training, and use the trained network to extract data features. It should be noted that,

[0039] Pre-training the feature extraction convolutional network includes training the resnet101 network with the imagenet database, and then using the pre-trained network to extract the features of all samples in the training data set, and setting the label data feature to f l (x j μ j ), x j is the jth label data, μ j For its corresponding label, the unlabeled data feature is f u (x i ), x i is the i-th unlabeled data.

[0040] S2: Use the nearest neighbor method to assign pseudo-labels to a batch of unlabeled data closest to the labeled ...

Embodiment 2

[0057] In the second embodiment of the present invention, in order to better verify and illustrate the technical effect adopted in the method of the present invention, in this embodiment, the application of pedestrian re-identification is selected for testing, and the test results are compared by means of scientific demonstration. To verify the real effect of the method;

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semi-supervised learning pseudo label assignment method based on clustering fusion, and the method comprises the steps: carrying out the semi-supervised learning of a convolutional neural network with a label-free data set, carrying out the pre-training of the neural network through employing labeled data and label-free data, and extracting data features through employinga trained network; assigning pseudo tags to N pieces of untagged data closest to the tagged data by using a nearest neighbor method; analyzing all the data information by using k-means clustering, and endowing clustered pseudo tags to the data which is not tagged; and continuously training the convolutional neural network by using the obtained label data and pseudo label data to obtain an optimalnetwork for label assignment. The method can be suitable for semi-supervised learning under deep learning in various fields; information of label-free data can be fully mined, and training data withricher content are provided for a network; the principle is clear and easy to understand, and codes are easy to implement.

Description

technical field [0001] The invention relates to the technical field of pseudo-label assignment of semi-supervised learning, in particular to a method for assigning pseudo-labels of semi-supervised learning based on cluster fusion. Background technique [0002] With the increasing development of deep learning, fully supervised learning using labeled data to train neural networks has achieved good results. However, in daily life, relying on manual labeling data often consumes a lot of human and financial costs, and unlabeled data is often easily obtained in large quantities. Therefore, in recent years, semi-supervised and unsupervised learning has attracted the attention of researchers. focus on. Semi-supervised learning is between supervised learning and unsupervised learning. It not only takes into account the accuracy of supervised learning, but also takes into account the practicability of unsupervised learning. It is a key issue in the field of pattern recognition and ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/088G06N3/045G06F18/2155G06F18/23213G06F18/24147
Inventor 白万荣张玉刚魏峰朱小琴刘吉祥王蓉张蕾
Owner STATE GRID GASU ELECTRIC POWER RES INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products