Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Convolutional neural network INT8 quantification method, system and device and storage medium

A technology of convolutional neural network and quantization method, which is applied in the field of systems, equipment and storage media, and INT8 quantization method of convolutional neural network, which can solve problems such as high requirements for image data distribution, unfavorable hardware underlying design, and weak expression ability , to achieve the effect of improving quantization accuracy, reducing model correction time, and reducing accuracy loss

Pending Publication Date: 2020-10-13
SUZHOU KEDA TECH
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although some convolutional neural network quantization methods have appeared in the prior art, such as the quantization methods of Google and Cambrian, these methods generally adopt linear quantization, and the expression ability of smaller values ​​in parameters and input and output is not strong. prone to loss of accuracy
Moreover, there are often floating-point numbers in the model inference of the existing methods, which is not conducive to the underlying design of the hardware
In addition, the model correction in the existing scheme requires more pictures to determine the coefficients of the convolutional layer, and has higher requirements on the distribution of picture data. For example, the existing TensorRT quantization requires thousands of pictures for quantization.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network INT8 quantification method, system and device and storage medium
  • Convolutional neural network INT8 quantification method, system and device and storage medium
  • Convolutional neural network INT8 quantification method, system and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals denote the same or similar structures in the drawings, and thus their repeated descriptions will be omitted.

[0065] Such as figure 1 As shown, in an embodiment of the present invention, the convolutional neural network INT8 quantization method includes the following steps:

[0066] S100: Obtain parameter quantization coefficients of the convolutional layer, and perform parameter nonlinear mapping on the parameters of the convolutional layer to obtain quantized parameters;

[0067] S200: Input the corrected image data into the convo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a convolutional neural network INT8 quantification method, system and device and a storage medium, and the method comprises the steps: obtaining a parameter quantification coefficient of a convolution layer, carrying out the parameter nonlinear mapping of a parameter of the convolution layer, and obtaining a quantified parameter; obtaining an input quantization coefficient of the convolution layer, and establishing an input quantization function of the convolution layer, the input quantization function being used for performing nonlinear mapping on input data of the convolution layer to obtain quantized input data; and obtaining an output quantization coefficient of the convolution layer, and establishing an output quantization function of the convolution layer according to the quantized parameter and the input quantization function, the output quantization function being used for performing nonlinear mapping on output data of the convolution layer to obtain quantized output data. By the adoption of the method and device, off-line nonlinear quantization is conducted on model parameters, input and output, pure integer operation of the whole model is achieved,and quantization precision is improved.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a convolutional neural network INT8 quantization method, system, device and storage medium. Background technique [0002] Today's neural network algorithms have great potential and amazing recognition rates in the visual field, and have huge applications in many fields. With the rise of mobile phones, the processing power of mobile phones has become stronger and stronger. The size of a few hundred megabytes is still insufficient. Most neural network model training is 32-bit floating-point numbers, and based on such models, in most cases, they cannot be well ported to mobile terminals, so model compression is particularly important. [0003] Although some convolutional neural network quantization methods have appeared in the prior art, such as the quantization methods of Google and Cambrian, these methods generally adopt linear quantization, and the expression abi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 徐超杨冬梅艾佳楠章勇曹李军
Owner SUZHOU KEDA TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products