Image classification method based on hyper-parameter optimization CNN

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A classification method and hyperparameter technology, applied in the field of image recognition, can solve the problems of optimal CNN image classification, low optimization efficiency, and hyperparameter optimization staying in local optimality, so as to avoid unsatisfactory CNN classification accuracy and improve convergence speed. and convergence accuracy, improving the effect of easily falling into local optimum

Active Publication Date: 2019-11-01

NORTHEASTERN UNIV

View PDF4 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Hyperparameter optimization based on particle swarm optimization, its search efficiency far exceeds other hyperparameter optimization algorithms such as grid search and random search, speeds up the search time of hyperparameter optimization, and solves the problem of low optimization efficiency and time-consuming traditional hyperparameter optimization And other issues

However, the particle swarm algorithm has the problem that it is easy to fall into the local optimum, which will cause the hyperparameter optimization to stay at the local optimum instead of the global optimal set of hyperparameters, which to a certain extent makes it impossible to search for the optimal CNN performance. A set of hyperparameters, so that CNN image classification cannot achieve the most ideal results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0046] Such as figure 1 As shown, the method of this embodiment is as follows.

[0047] Step 1: Select the benchmark data set handwritten digit recognition MNIST data set as the data set to be classified, the fragments of the data set are as follows figure 2 As shown, the dataset has 70,000 grayscale images, each image is 28×28 pixels, which contains 10 categories, and each category has 7,000 images. Randomly select 60,000 images in the data set as the training set, and the remaining 10,000 images as the test set, and randomly select 6,000 images from the training set as the verification set.

[0048] Step 2: Build a CNN architecture, including convolutional layer C1, pooling layer P1, convolutional layer C2, and pooling layer P2, which is activated and terminated by Softmax. According to the structural characteristics of the CNN architecture, the structural parameters of the selected convolutional layer C1 and the pooling layer P1 are hyperparameters of the present inventi...

Embodiment 2

[0081] Such as figure 1 As shown, the method of this embodiment is as follows.

[0082] Step 1: Select the benchmark data set object recognition cifar-10 data set as the data set to be classified. The fragments of the data set are as follows Figure 4 As shown, the dataset has 60,000 32×32 pixel color images, which contain 10 categories, each category has 6,000 images. Randomly select 50,000 images in the data set as the training set, the remaining 10,000 images are used as the test set, and 5,000 images are randomly selected from the training set as the verification set.

[0083] Steps 2 to 9 are the same as in Example 1, and the classification performance of the cifar-10 data set before and after the improvement of the hyperparameter optimization method changes with the number of iterations as follows: Figure 5 shown.

[0084] The comparison of the image classification accuracy of the handwritten digit recognition MNIST dataset and the object recognition cifar-10 dataset...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an image classification method based on hyper-parameter optimization CNN, and belongs to the technical field of image recognition. According to the method, structural parametersof a convolution layer C1 and a pooling layer P1 are selected as hyper-parameters of the method according to structural characteristics of a CNN architecture. The value range of the hyper-parametersis limited to be (Xl, Xu). After that,, the hyper-parameter of the CNN is optimized by adopting two periodic variation PSO algorithms of global variation and local variation. Therefore, the optimization of the hyper-parameter by the traditional PSO algorithm is prevented from staying at local optimum. The image classification performance which is more competitive than the traditional PSO algorithmis obtained. Compared with the prior art, the efficiency and the cost of deep learning CNN hyper-parameter optimization are obviously improved. The image classification potential of the CNN architecture is brought into play to the maximum extent. Hardware resources and calculation cost during image classification of the CNN are saved. The method has certain application value in engineering practice.

Description

technical field [0001] The invention relates to the technical field of image recognition, in particular to an image classification method based on hyperparameter optimization CNN. Background technique [0002] Image classification technology has been relatively mature, and CNN architectures suitable for different scene classifications emerge in an endless stream, but complex CNN structures often consume hardware resources and computing costs. Before CNN is used for image classification training, some internal parameters of CNN need to be set in advance. These parameters are called hyperparameters. Selecting a set of optimal hyperparameters can maximize the performance of CNN image classification without changing the structure of CNN. Therefore, it is particularly important in engineering practice to select suitable hyperparameters to fully release the image classification performance of the CNN architecture. [0003] Research on hyperparameter optimization methods for image...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/62G06N20/20

CPCG06N20/20G06F18/24147G06F18/241Y02T10/40

Inventor 付俊王思淼

Owner NORTHEASTERN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Image classification method based on hyper-parameter optimization CNN

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology