Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Pruning method and system based on Crossbar architecture

A pruning and scheme technology, applied in the direction of neural architecture, neural learning methods, biological neural network models, etc., can solve the problems of compression ratio limitation, waste of hardware resources, and inability to use sparsity, so as to improve compression ratio and reduce resource occupation , the effect of reducing waste

Active Publication Date: 2020-08-07
SHANGHAI JIAO TONG UNIV
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In some existing technologies, the weight of each column in each Crossbar is used as the basic pruning granularity, and the remaining columns are left-shifted and filled. This method finally obtains the pruning result. On the one hand, the algorithm only uses The sparsity in the column direction of the neural network weight matrix cannot be used, but the sparsity in the row direction cannot be used, so its compression ratio will be limited
On the other hand, after the algorithm performs left shift and complement operation on the remaining weights in the pruning results, most of the units in some Crossbars are idle, wasting hardware resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pruning method and system based on Crossbar architecture
  • Pruning method and system based on Crossbar architecture
  • Pruning method and system based on Crossbar architecture

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0075] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts all belong to the protection scope of the present invention.

[0076] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0077] image 3 It is a schematic flow chart of the pruning method based on the Crossbar architecture of the present invention. Such as image 3 As shown, the pruning method based on the Crossba...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a pruning method and system based on a Crossbar architecture. The method comprises the following steps: carrying out structural pruning on a to-be-processed weight matrix of acurrent layer of a neural network without considering an architecture factor to obtain a first weight matrix, wherein the weight matrix is a weight matrix of the neural network; determining a pruningscheme according to the size of the first weight matrix and the size of a crossbar, wherein the pruning scheme comprises block-based structured pruning, crossed array row pruning and crossed array column pruning; pruning the first weight matrix according to the pruning scheme to obtain a second weight matrix; and mapping the second weight matrix to a crossbar array in an accelerator, and accelerating the neural network through the accelerator. The resource occupation of Crossbar can be reduced, and the waste of hardware resources can be reduced.

Description

technical field [0001] The present invention relates to the field of network acceleration, in particular to a pruning method and system based on Crossbar architecture. Background technique [0002] At present, deep neural network (DNN) is widely used in image classification, target detection, semantic segmentation, speech recognition and other fields. However, due to the large weight size and computational load of mainstream neural networks, many neural network accelerators are designed to use the parallelism of the calculation of the convolutional layer and the fully connected layer in the neural network to speed up the reasoning process of the network. One of them is A typical accelerator is a vector-matrix multiplication accelerator based on the Crossbar architecture. [0003] figure 1 It is an architectural diagram of a typical neural network accelerator based on a crossbar architecture. The chip includes a plurality of small tiles (Tile), and each tile (Tile) is compo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/082G06N3/045Y02D10/00G06N3/0464G06N3/065
Inventor 蒋力褚超群
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products