Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Low bit width bias cross-border processing method in quantitative reasoning process

A technology of reasoning process and processing method, which is applied in the field of low-bit-width bias out-of-bounds processing in the quantitative inference process, and can solve problems such as affecting the quantitative inference accuracy of models, model inference errors, and bias out-of-bounds.

Pending Publication Date: 2022-07-01
合肥君正科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in many low-bit models, in order to ensure the calculation speed in the process of model quantization and reasoning, the feature and bias calculation results are stored in int16-bit width, so that the bias will inevitably appear out of bounds
[0003] In particular, when quantizing float 32bit to a low bit, due to the need to ensure the calculation speed after quantization, the network model needs to be calculated on int16. Due to the addition of corresponding quantization operation methods, when merging quantization parameters, the bias is susceptible to various Influenced by factors, the bias is prone to cross-border situations, which in turn affects the quantitative reasoning accuracy of the model, causing reasoning errors in the model during the reasoning process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Low bit width bias cross-border processing method in quantitative reasoning process
  • Low bit width bias cross-border processing method in quantitative reasoning process
  • Low bit width bias cross-border processing method in quantitative reasoning process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] In order to understand the technical content and advantages of the present invention more clearly, the present invention will now be further described in detail with reference to the accompanying drawings.

[0034] This application belongs to the processing method of deep neural network based on low-bit (4bit, 5bit) training and prone to bias out-of-bounds in the forward reasoning process. The output results on the inference library are consistent with the output results of the network training framework. It is a processing method to prevent bias out-of-bounds in low-bit quantization bit width and ensure the correct forward inference of the model.

[0035] Based on the low-bit quantization operation process to ensure the accuracy of model inference and reduce the loss of accuracy in the process of model inference, the gamma value of the batchnorm channel is processed separately to avoid the problem of bias outliers in the batchnorm during the weight merging process. The...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The method aims at eliminating abnormal values existing when bias are combined in the low-bit quantization process of the model, and it is ensured that the model reasoning precision and the training network framework are kept consistent. The invention provides a low bit wide bias border crossing processing method in a quantitative reasoning process, and the method comprises deep learning convolution floating point type float operation, and comprises the following steps: S1, inputting floating point type data; s2, convolution is carried out, and weight merging is carried out; s3, carrying out batch standardization, combining related parameters in a quantization process, independently processing a gamma value of a batchnorm channel, and carrying out a derivation formula on the batchnorm in a specific quantization process as follows: assuming quantization calculation of an ith layer: limiting the gamma value to 0.1 or more in an actual training process, and ensuring that the bias does not cross a boundary; and S4, outputting a floating point type result.

Description

technical field [0001] The invention relates to the technical field of image processing, in particular to a low-bit-width bias out-of-bounds processing method in a quantization inference process. Background technique [0002] In the prior art, as the number of layers stacked in the convolutional neural network model increases, the number of weight parameters of the network model also increases. The main purpose of low-bit quantization of the model is to reduce the model storage volume and feature output. Save the bit width, which can speed up the operation. However, in many low-bit models, in order to ensure the operation speed during the model quantization inference process, the feature and bias operation results are stored in an int16-bit width, so that the bias out-of-bounds situation will inevitably occur. [0003] In particular, when the float 32bit is quantized to low bits, the network model needs to be operated on int16 because of the need to ensure the operation spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N5/04G06N3/04G06N3/08
CPCG06N5/045G06N3/04G06N3/08
Inventor 周飞飞
Owner 合肥君正科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products