Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Quantification method of convolutional neural network based on knowledge distillation

A technology of convolutional neural network and quantization method, applied in neural learning method, biological neural network model, neural architecture, etc., to achieve the effect of increasing the degree of compression, compressing storage requirements and computing requirements, and high compression ratio

Pending Publication Date: 2020-12-01
MOMENTA SUZHOU TECH CO LTD
View PDF2 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a quantization method of convolutional neural network based on knowledge distillation, which solves the binarization problem of depth separable convolution, and then designs a more suitable model optimization for binarized convolutional neural network method, and applied to the ResNet series network, compared with the optimization method in the prior art, it can greatly improve the network accuracy of the convolutional neural network based on the ResNet series after binarization, and it is actually used for visual scenes of classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quantification method of convolutional neural network based on knowledge distillation
  • Quantification method of convolutional neural network based on knowledge distillation
  • Quantification method of convolutional neural network based on knowledge distillation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the following will clearly and completely describe the technical solutions of the embodiments of the present invention in conjunction with the drawings of the embodiments of the present invention. Apparently, the described embodiments are some, not all, embodiments of the present invention. Based on the described embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention. Unless otherwise defined, the technical terms or scientific terms used herein shall have the usual meanings understood by those skilled in the art to which the present invention belongs.

[0039] The words "comprising" or "comprising" and other similar words used in the patent application specification and claims of the present invention mean that...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a quantification method of a convolutional neural network based on knowledge distillation. The invention relates to the technical field of convolutional neural networks. The optimal cutting proportion and quantization digit of each network weight layer of the convolutional neural network is obtained by using a dynamic search mode; the trained convolutional neural network issequentially subjected to sparsification, knowledge distillation fine adjustment training by adopting a teacher network, layer-by-layer quantification and knowledge distillation fine adjustment training by using the teacher network again, so that the storage requirement and the calculation requirement of the convolutional neural network are greatly compressed on the premise of keeping the networkprecision. In the quantitative compression process of the convolutional neural network, the teacher network is used for knowledge distillation-based fine adjustment training, different quantization precisions can be adopted for different network weight layers of the convolutional neural network, and the network precision cannot be excessively lost, so that the compression degree of the convolutional neural network can be increased in the quantization process.

Description

technical field [0001] The invention relates to the technical field of convolutional neural networks, in particular to a quantification method of convolutional neural networks based on knowledge distillation. Background technique [0002] At present, a large number of visual application scenarios at home and abroad use convolutional neural network (Convolutional Neural Network) for feature extraction, including online image classification, recognition, detection services, face recognition, re-identification, security monitoring and other application scenarios. Compared with the traditional scheme, the solution based on convolutional neural network has the advantages of high precision and strong generalization ability, but it requires a large amount of calculation, high hardware requirements, and high storage requirements, which makes the popularization and use of the scheme subject to limit. The general convolutional neural network uses floating-point numbers for related op...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/082G06N3/045
Inventor 吴梓恒胡杰曾梦泽
Owner MOMENTA SUZHOU TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products