LSTM (Long Short-Term Memory) forward direction operation accelerator based on FPGA (Field Programmable Gate Array)

An accelerator and hardware accelerator technology, applied in complex mathematical operations, instruments, calculations, etc., can solve problems such as limited promotion, long programming development cycle, and inability to modify circuit logic, and achieve the effect of improving computing performance.

Inactive Publication Date: 2018-11-06
SUZHOU INST FOR ADVANCED STUDY USTC
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But ASIC also has disadvantages: the programming development cycle is long, and it cannot be modified after the circuit logic design and manufacture are completed, and if the logic needs to be modified, it must be redesigned
Nowadays, the research on neural network algorithms is constantly innovating, so the poor design flexibility of ASIC limits its promotion in the field of scientific research

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • LSTM (Long Short-Term Memory) forward direction operation accelerator based on FPGA (Field Programmable Gate Array)
  • LSTM (Long Short-Term Memory) forward direction operation accelerator based on FPGA (Field Programmable Gate Array)
  • LSTM (Long Short-Term Memory) forward direction operation accelerator based on FPGA (Field Programmable Gate Array)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0046] figure 1 It is a working schematic diagram of the LSTM neural network forward computing accelerator system based on FPGA, which is divided into software side and hardware side. In the specific application tasks performed on the software side, when it is necessary to use the LSTM neural network for forward computing, upload the computing tasks to It is completed on the FPGA hardware accelerator, and the calculation results of the hardware are sent back to the software side for subsequent operations. According to the parameter scale of the LSTM network, different hardware accelerators are used: when the on-chip storage resources of the FPGA are sufficient to cache all LSTM parameters, a single DMA mode hardware accelerator is used, and all parameters are cached on the chip at one time; when the LSTM parameter scale exceeds When the on-chip storage capacity of the FPGA is used, the dual DMA mode hardware accelerator is used, and some parameters required for each calculatio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a LSTM (Long Short-Term Memory) forward direction operation accelerator based on an FPGA (Field Programmable Gate Array), and works by a hardware and software coordination pattern. A hardware part contains three types of accelerator designs: a single-DMA (Direct Memory Access) pattern LSTM neural network forward direction algorithm accelerator, a double-DMA pattern LSTM neural network forward direction algorithm accelerator and a spare LSTM neural network forward direction algorithm accelerator. The accelerator is used for accelerating a LSTM network forward direction calculation part and comprises a matrix-vector multiplication module, an Element-wise operation module and an activation function module. The single-DMA pattern accelerator has a good operation effecton the aspects of performance and an energy efficiency ratio. The double-DMA pattern accelerator and the sparse network accelerator have the good effect on the aspect of the energy efficiency ratio, and in addition, more on-chip storage resources of the FPGA can be saved.

Description

technical field [0001] The invention relates to the field of computer hardware acceleration, in particular to an FPGA-based LSTM forward operation accelerator and a design method thereof. Background technique [0002] Long short-term memory (LSTM) network is currently the most mainstream variant of recurrent neural network (RNNs), which solves the long-term dependence problem faced by traditional recurrent neural network algorithms in practical tasks. Based on the characteristics of the LSTM network, it is widely used in the fields of speech recognition, machine translation and image processing. Since the LSTM neural network contains a large number of parameters, the operation process is a calculation-intensive operation. Using a general-purpose hardware platform such as a CPU to execute the LSTM algorithm does not perform well in terms of computing performance and energy consumption. Therefore, using dedicated hardware accelerators to improve the computing performance of L...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/16G06F7/523
CPCG06F7/523G06F17/16
Inventor 李曦周学海王超张奕玮
Owner SUZHOU INST FOR ADVANCED STUDY USTC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products