Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

A method and a device for improving the computing performance of a GEMM

A technology of computing performance and algorithms, applied in the computer field, can solve problems such as poor results

Active Publication Date: 2019-03-12
XFUSION DIGITAL TECH CO LTD
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For the detection stage of deep learning, the existing methods to improve the computing performance of GEMM are not effective.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and a device for improving the computing performance of a GEMM
  • A method and a device for improving the computing performance of a GEMM
  • A method and a device for improving the computing performance of a GEMM

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The method and device for improving the computing performance of GEMM disclosed in this application can be applied in the process of detecting targets using deep convolutional neural networks. Such as figure 1 As shown, after completing the training of the deep convolutional neural network, the deep convolutional neural network is used to detect the acquired objects. For example, a face image is obtained through an image acquisition device, and as a target, a trained deep convolutional neural network is used to detect the face image, and a detection result indicating the identity of the face is obtained.

[0025] Among them, in the process of detecting the target, the deep convolutional neural network converts the convolution calculation into GEMM calculation. During the research process, the applicant found that the existing methods for improving the performance of GEMM calculations are aimed at large-scale matrices. Therefore, they should be used in the process of tr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a method and apparatus for improving the computational performance of a GEMM, to the method comprises steps of obtaining the parameters of multiplication of the general matrix-matrix and GEMM calculation to be optimized and querying the target parameters from the parameters calculated by the at least one historical GEMM, and the target parameters are the parameterssatisfying a preset relationship with the parameters calculated by the GEMM calculation to be optimized. According to the preset correspondence between the parameters and the optimization method, anoptimization method corresponding to the objective parameters is determined. And optimizing the GEMM calculation to be optimized by using the optimization method corresponding to the objective parameters. Wherein the parameters calculated by the GEMM are determined based on the size of the matrix participating in the GEMM calculation. Because the characteristics of the matrices involved in the GEMM computation are used as the basis for the optimization of the GEMM computation, the performance of the GEMM computation can be improved even if the matrices are small in size or irregular in shape in the process of target detection using depth convolution neural network.

Description

technical field [0001] The present application relates to the computer field, in particular to a method and device for improving GEMM computing performance. Background technique [0002] General Matrix-matrix Multiplication (GEMM) calculations are generally calculations for dense matrices and are widely used in deep learning. Deep learning is a method based on representation learning of data in machine learning. [0003] With the development of deep learning, deep convolutional neural network has become the most widely used network structure, and it is widely used in the fields of image and speech. The core algorithm of the deep convolutional neural network is convolution calculation, and the current mainstream implementation method is to convert convolution calculation into GEMM calculation. Studies have shown that GEMM calculations occupy most of the computing resources of deep convolutional neural networks. It can be seen that the performance of GEMM calculations direct...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/16G06N3/04
CPCG06F17/16G06N3/045
Inventor 齐霁张邵敏贾海鹏
Owner XFUSION DIGITAL TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products