Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method for data correlation in parallel solving process based on cloud elimination equation of GPU (Graph Processing Unit)

A data correlation and equation technology, applied in complex mathematical operations and other directions, can solve the problems of destroying data continuity, destroying the natural continuity of grids, etc., to avoid low efficiency, ensure correctness, and eliminate data correlation.

Inactive Publication Date: 2012-06-20
NAT UNIV OF DEFENSE TECH
View PDF5 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The discretization of the grid in the second method destroys the natural continuity between the grids to a certain extent, thus destroying the continuity of the data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for data correlation in parallel solving process based on cloud elimination equation of GPU (Graph Processing Unit)
  • Method for data correlation in parallel solving process based on cloud elimination equation of GPU (Graph Processing Unit)
  • Method for data correlation in parallel solving process based on cloud elimination equation of GPU (Graph Processing Unit)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] figure 1 It is a schematic diagram of the relationship between grid, block, warp and thread in CUDA. In the figure, the thread grid grid is a three-dimensional structure of all thread sets triggered when a kernel call is started, and is divided into multiple three-dimensional structure thread blocks composed of the same number of threads in three dimensions. In a block, 32 consecutive Threads form a warp, which is the basic unit of management, scheduling, and execution.

[0038] figure 2 It is a two-dimensional schematic diagram of eliminating data correlation, where the blank area is the grid currently processed in parallel by multiple threads, and the shaded area is the shadow area of ​​these grids. The execution speed of the two threads in the left picture is different, resulting in the accumulation of current density values ​​to the same grid at the same time; the SIMT mechanism in the right picture makes the threads synchronized, and accumulates the current dens...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for data correlation in the parallel solving process based on a cloud elimination equation of a GPU (Graph Processing Unit) and aims at increasing the reusability and access efficiency of the data. The invention has the technical scheme that the data correlation between every two warp interior threads is eliminated by using a parallel mechanism of an SIMT (Single-Instruction Multiple-Thread); constructing a warp block by grids processed by 32 threads in a warp; determining an organization mode of the warp block; restricting the three dimensionality of the block and the grids according to the capacity of a shared memory; carrying out discretization on a whole simulation area with the warp block as a basic unit; dividing a global task into 8 groups; avoiding the data correlation between every two warp blocks in each group; starting kernel calling for 8 times; and finishing update of the current density of 1 / 8 grid in the whole simulation area. According to the method disclosed by the invention, the condition that no multiple threads are used for updating the current density of the same grid at the same time can be ensured; the data correlation between adjacent grids is eliminated; the reusability and high-efficiency access of the data are realized; and the operation speed of a CUDA (Compute Unified Device Architecture) program is increased.

Description

technical field [0001] The invention relates to a method for eliminating data correlation existing in the process of parallel solving of cloud equations in the field of computer high-performance computing, and in particular to a method for eliminating data correlation existing in the process of parallel solving of cloud equations by using the SIMT architecture in GPU. Background technique [0002] CUDA (Compute Unified Device Architecture) is a development environment and software system launched by NVIDIA for general computing on its GPU (Graph Processing Unit). The program running on the GPU is called kernel (kernel function), where the threads in the kernel are organized in the form of grid (thread grid), each grid is composed of several blocks (thread blocks), and each block is composed of several Thread (thread), where grid and block are three-dimensional structures composed of threads. When the kernel is scheduled for execution, the threads in the corresponding grid a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/11
Inventor 廖湘科杨灿群石志才王锋易会战黄春赵克佳陈娟吴强
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products