Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A convolutional neural network quantization circuit and quantization method

A convolutional neural network and circuit technology, applied in the field of artificial intelligence data processing, can solve the problems of equipment power consumption and running speed not meeting the requirements, new networks cannot be applied and verified, hindering algorithm optimization network progress and other problems, achieving deployment and operational reliability, reduced storage capacity and bandwidth requirements, deployment and operational assurance effects

Active Publication Date: 2020-11-03
INSPUR GROUP CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Due to the limitation of mobile terminal processing and storage level development, the artificial neural network with doubled depth and size can only be run in processing machines with large-scale computing resources, and the power consumption and running speed of the device cannot meet the requirements.
End-to-end transplantation cannot be performed, and thus cluster deployment cannot be performed
Some new networks suitable for various scenarios cannot be applied and verified, which hinders the optimization of algorithms and the progress of networks to a certain extent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A convolutional neural network quantization circuit and quantization method
  • A convolutional neural network quantization circuit and quantization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be further described below in conjunction with specific examples.

[0030] A convolutional neural network quantization circuit includes an original parameter pool, a comparator array, a quantization parameter calculation unit, an arithmetic operation unit, a fine-tuning unit and an activation unit.

[0031] The original parameter pool is used to store the original parameter data required for the calculation of each layer of the convolutional neural network, including each channel data and bias data of all convolution kernels of each layer, all expressed in a signed real number data format;

[0032] The comparator array is used to perform statistical operations on the data in the original parameter pool, and iteratively compares to obtain the maximum and minimum values ​​of the parameters of each layer of the convolutional neural network;

[0033] The quantization parameter calculation unit is used to perform arithmetic operations on the maximu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a convolutional neural network quantization circuit and a quantization method, belonging to the technical field of artificial intelligence data processing, including an original parameter pool, a comparator array, a quantization parameter calculation unit, and an arithmetic operation unit. The original parameter pool is used for storing The original parameter data required for each layer of convolutional neural network calculations, including each channel data and bias data of all convolution kernels in each layer; the comparator array is used to perform statistical operations on the data in the original parameter pool, and iteratively compares to obtain The maximum value and the minimum value of the parameters of each layer of the convolutional neural network; the quantization parameter calculation unit is used to perform arithmetic operations on the maximum value and the minimum value to obtain the parameters used for model quantization; the arithmetic operation unit is used for The model is quantized, and the obtained results are expressed in integer format with the number of bits specified by the unsigned bit. The invention can reduce the power consumption of the system through quantization, so that the deployment and operation of the convolutional neural network on the terminal can be more reliably guaranteed.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence data processing, in particular to a convolutional neural network quantization circuit and a quantization method. Background technique [0002] As an important direction in the development trend of artificial intelligence, convolutional neural network has been in a state of intense development. Various new models and new algorithms emerge in an endless stream, continuously injecting new impetus into this field. Among them, the increase in the depth and scale of the network model is the main development direction. In the process of continuous improvement in accuracy, the deployment and implementation of the neural network is facing great challenges. [0003] Due to the limitations of the development of mobile terminal processing and storage levels, the artificial neural network whose depth and size have doubled can only be run on processing machines with large-scale computing resour...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/06G06N3/04
CPCG06N3/063G06N3/045
Inventor 王子彤姜凯于治楼
Owner INSPUR GROUP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products