Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Image classification method based on self-attention mechanism

A classification method and attention technology, applied in neural learning methods, computer components, instruments, etc., can solve the problems of convolutional network substitution is not obvious, trying imperfections, etc.

Active Publication Date: 2021-09-10
沈阳雅译网络技术有限公司
View PDF20 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the inadequacy of the attention mechanism in the prior art to replace the convolutional network, and the incomplete attempt to introduce the advantages of the Transformer model into the image field, the technical problem to be solved by the present invention is to provide a self-attention mechanism based Image classification method, exploring the application of Transformer structure in image classification tasks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image classification method based on self-attention mechanism
  • Image classification method based on self-attention mechanism
  • Image classification method based on self-attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0043] The present invention aims at the current situation that the traditional image classification method is mainly based on the convolutional neural network to extract features, and proposes an image classification method with a self-attention mechanism, which successfully introduces the Transformer model in the natural language processing task into the image task Among them, the global information of the image is extracted through the self-attention mechanism to form image features, which provides more possibilities for subsequent research while achieving effective image classification.

[0044] The present invention provides a kind of image classification method based on self-attention mechanism, and the technical scheme adopted is:

[0045] 1) Construct a Transformer model including a self-attention mechanism, modify the model structure, and a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an image classification method based on a self-attention mechanism, and the method comprises the steps of constructing a Transform model containing the self-attention mechanism, and adding a classifier unit for an image classification task; processing the public data set ImageNet, and adjusting the original picture to a proper size; dividing the adjusted picture into sub-pictures with fixed sizes, connecting the sub-pictures, and performing dimension adjustment to obtain a picture embedding vector; performing two-dimensional position coding to obtain a two-dimensional position coding vector, and connecting the two-dimensional position coding vector with the picture embedding vector to serve as model input; and sending the connected vectors to a Transform model, extracting picture features, converting the vectors output by the model into probability representation through a classifier unit during final decoding, and completing image classification. By using the self-attention mechanism, the global information, namely the picture features extracted by the traditional convolutional neural network, can be effectively extracted from the picture, and the picture classification can be effectively completed based on the extracted features.

Description

technical field [0001] The invention relates to an image classification technology, in particular to an image classification method based on a self-attention mechanism. Background technique [0002] Image classification is a pattern classification problem. Its goal is to divide different images into different categories and achieve the minimum classification error. Its typical method is to extract the features of the image, and assign the classification label to the image based on the features. Image classification tasks have experienced decades of development from traditional methods to deep learning-based methods. The current method is mainly based on the convolutional neural network structure to extract image features, and on the basis of the network, operations such as deepening the number and depth of the model and improving the convolution method are adopted. The change of methods has made the performance of basic image classification tasks close to saturation, and t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/241Y02T10/40
Inventor 杨木润赵闯
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products