Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for realizing heterogeneous many-core of sparse matrix-vector multiplication based on domestic SW26010 processors

A sparse matrix and implementation method technology, applied in the direction of electrical digital data processing, instruments, machine execution devices, etc., can solve the problem of unbalanced load access bandwidth utilization sparse matrix, etc., to improve the utilization rate of memory access bandwidth and realize load The effect of balancing and reducing the total memory access

Active Publication Date: 2017-05-31
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF10 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patented technology allows for better resource management by dividing tasks into smaller parts based upon their characteristic values instead of relying solely on previous techniques like assigning or randomizing resources. By doing that, system efficiency increases while reducing unnecessary workload. Additionally, the spaware approach uses sparse data elements without storing them locally has lower latency than traditional approaches due to its location within an image processing unit (GPU). Overall, these technical improvements help optimize both hardware usage and software applications running at GPUs.

Problems solved by technology

This patented technical solution describes various techniques for efficiently performing computations involving multiply data streams called sparsity or sparse matrices. One technique uses algorithms like convolution decomposition, random field approximation, linear programming, etc., while another method includes decomposable multidimensional mapping and advanced modulation systems. Additionally, certain methods aim at reducing memory access latency without sacrificializing thread execution capabilities. Overall, both types of solutions provide benefits over conventional approaches due to their ability to optimize resource allocation and minimize interference during computation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for realizing heterogeneous many-core of sparse matrix-vector multiplication based on domestic SW26010 processors
  • Method for realizing heterogeneous many-core of sparse matrix-vector multiplication based on domestic SW26010 processors
  • Method for realizing heterogeneous many-core of sparse matrix-vector multiplication based on domestic SW26010 processors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be described in detail below in conjunction with examples.

[0044] like figure 1 As shown, the implementation process of the main core version of SpMV is as follows:

[0045] (1) Carry out cyclic calculation for each row of the sparse matrix, first obtain the current row number and judge, if the current row number is less than the total number of rows of the sparse matrix, proceed to the next step;

[0046] (2) Traverse all the sparse non-zero elements in each row, obtain the value information of the current non-zero element and the column subscript information through array access, and obtain the value of the vector x according to the column subscript information, multiply the two together and obtain Accumulate to get the calculation result of the current row;

[0047] (3) Assign the calculation result to the vector y.

[0048] like figure 2 Shown, the specific realization of SpMV of the present invention is as follows:

[0049] (1) Ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for realizing heterogeneous many-core of sparse matrix-vector multiplication based on domestic SW26010 processors. As the non-zero elements of a sparse matrix are distributed very irregularly, two different static and dynamic task partitioning methods are designed in the method, so as to adapt to different sparse matrices; a set of dynamic and static cache mechanism is provided, so as to promote the memory access hit rate of a vector x; a set of self-adapting optimization method is provided, specific to the sparse matrix, the optimal execution parameters can be dynamically selected, so as to promote the running performance. According to the method disclosed by the invention, 16 sparse matrices in a Matrix Market matrix set are adopted to conduct test, the running edition SpMV has about 10 times of acceleration to the utmost extent compared with the single main core of a domestic SW processor, and the average speed-up ratio is 6.51.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products