Fixed-point neural network model quantification device and method

A neural network model and model technology, applied in the field of artificial intelligence, can solve problems such as the inability to completely avoid rounding errors in model operations, immature quantification technology, and the inability to effectively guarantee model accuracy, so as to improve execution efficiency and transparency, Good precision, easy to use effect

Active Publication Date: 2020-03-27
陈子祺 +1
View PDF8 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, on the one hand, the existing model quantization technology is not mature enough. While the performance of the model is improved, the accuracy of the model cannot be effectively guaranteed.
On the other hand, most of the existing model quantization devices only accelerate some specific operators (such as convolution operators, matrix multiplication operators), but there are still a large number of floating-point intermediate values ​​in the calculation process, which This kind of semi-integer quantization still cannot completely avoid the rounding error in the model operation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fixed-point neural network model quantification device and method
  • Fixed-point neural network model quantification device and method
  • Fixed-point neural network model quantification device and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the purpose, technical solution and advantages of the application clearer, the embodiments of the application will be described in detail below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined arbitrarily with each other.

[0056] According to the first aspect of the present application, the flow of a fixed-point neural network model quantification method is as follows:

[0057] Verification stage: the verification graph model is a directed acyclic graph, and the multi-input graph model is converted into a single-input model; among them, the graph model has no repeated naming symbols, and the useless parameters of the model are removed. If the verification graph model does not meet the standard, Then the error information can be fed back to the user; among them, the relevant verification of the operator can al...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a fixed-point neural network model quantification method and device. The method comprises the following steps: a verification stage: a verification graph model is a directed acyclic graph, and a multi-input graph model is converted into a single-input model; a preparation stage: the graph model is converted equivalently to facilitate subsequent quantification; a scaling stage: all samples are input and executed in the floating point model, statistics is conducted on output of all operators in the model, and possible output domain values of the operators in all the samples are predicted according to characteristics of output data; and a quantization stage: fixed-point conversion is performed on the operator according to topological sorting by the model. According tothe method, the storage and calculation overhead of the model can be effectively reduced, the uncertainty caused by rounding errors in floating point operation is eliminated, and the high efficiency,transparency and safety of the deep neural network model are improved.

Description

technical field [0001] The present application relates to a fixed-point neural network model quantification device and method, which are suitable for the technical field of artificial intelligence. Background technique [0002] Deep neural network models are widely used in machine vision tasks such as image classification and object detection, and have achieved great success. However, due to the limitations of storage space and power consumption, the storage and computation of neural network models on embedded chips and specially designed neural network chips is still a great challenge. At the same time, the design of existing neural network models usually only considers the accuracy rate, but does not consider the reproducibility and consistency of the operation, and the operation results may be inconsistent under different architectures or even the same computing environment. Affected by this, the application of neural network algorithms in fields with higher security req...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06N3/04
CPCG06N3/063G06N3/045Y02D10/00
Inventor 陈子祺田甲
Owner 陈子祺
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products