Compression method and system for deep convolutional neural network

A convolutional neural network and neural network technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as model accuracy drop, network accuracy drop, and connection impact

Pending Publication Date: 2020-09-01
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, the usual structured pruning method has the following disadvantages: 1) When removing redundancy, unimportant connections are directly removed, resulting in a large drop in network accuracy; 2) The pruning method based on sparse constraints, in the entire In the process of model training, a fixed penalty is always added to all connections, so that important connections are affected in the process of learning sparseness, which leads to a decrease in model accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Compression method and system for deep convolutional neural network
  • Compression method and system for deep convolutional neural network
  • Compression method and system for deep convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0073] The purpose of the present invention is to provide a compression method for deep convolutional neural networks, taking into account the importance of the convolution kernel, by applying progressive sparse constraints to unimportant filters, and adding regularization terms to the loss function of network training , to obtain the optimized loss function; use the threshold iterative algorithm and the back propagation algorithm to jointly solve the problem, obtain the updated parameters of the deep convolutional neural network to be compressed, and then establish a convolutional neural network model with a filter sparse form, and use the str...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a compression method and system for a deep convolutional neural network, and the method comprises the steps: an unimportant filter in a to-be-compressed deep convolutional neural network is determined according to a filter importance selection mode and/or a model compression rate; progressive sparse constraints are applied to the unimportant filters and added into a loss function of network training as regular terms, and an optimized loss function is obtained; according to the regular term, a threshold iterative algorithm and a back propagation algorithm are adopted for combined solving, and updating parameters of the to-be-compressed deep convolutional neural network are obtained; a convolutional neural network model with a filter sparse form based is obtained onthe optimization loss function and the updating parameters; and the convolutional neural network model is pruned with the filter sparse form by using a structured pruning algorithm to obtain a compressed convolutional neural network model with relatively high network precision.

Description

technical field [0001] The invention relates to the technical fields of convolutional neural networks and artificial intelligence, in particular to a compression method and system for deep convolutional neural networks. Background technique [0002] With the continuous development of deep neural networks in the field of artificial intelligence, such as computer vision, speech recognition, and natural language processing, researchers from all walks of life have achieved more intelligent results by deploying artificial intelligence-related algorithms into actual products. Effect, and then promote the arrival of a new round of artificial intelligence research climax. [0003] However, when deep neural networks are deployed in practical applications, they require huge storage space and high computing resource consumption, making it difficult to apply them in some mobile terminals and embedded devices. Therefore, deep neural network model compression and acceleration methods hav...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/084G06N3/045
Inventor 胡卫明刘雨帆阮晓峰李兵李扬曦
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products