A Parallel Discrete Update Method for Heterogeneous Many-Core Processors

A technology of many-core processor and update method, applied in the field of parallel discrete update for heterogeneous many-core processors, can solve the problem of inconvenient automatic transformation of compilers, reduce program readability and portability, and increase programming burden, etc. problems, to achieve good readability and portability, increase user burden, and facilitate automatic transformation.

Active Publication Date: 2022-07-12
JIANGNAN INST OF COMPUTING TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The disadvantages of the producer-consumer mode technology include: 1) The source code must be transformed according to the requirements of the producer-consumer mode, and a simple segment of the array discrete update cycle code segment in the original code is transformed into a complex producer-consumer code segment , it is not convenient for the compiler to automatically transform the program, which reduces the readability and portability of the program; 2) It is necessary to divide the slave cores in the slave core group into two parts in proportion: producers and consumers. This ratio requires It is determined by weighing factors such as core processing power and computing load, or obtained by running tests, which increases the programming burden

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Parallel Discrete Update Method for Heterogeneous Many-Core Processors
  • A Parallel Discrete Update Method for Heterogeneous Many-Core Processors

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0040] Embodiment: a parallel discrete update method for heterogeneous many-core processors, comprising the following steps:

[0041] S1. Divide the discrete update data area with continuous addresses into blocks, and map the data blocks with the slave cores, so that each slave core is responsible for updating one piece of data, as follows:

[0042] In the discrete update data area with continuous addresses, the data units are continuously addressed in the main memory space, and the update operation is performed on the data unit in the discrete update data area, that is, the value of the data unit is taken out, and an operation is performed on it, for example Add a numerical value, and finally write the result back to the above data unit, and the update operation on the data unit in the discrete update data area is discrete;

[0043] S11. The method of dividing the discrete update data area with continuous addresses into blocks is: assuming that the number of slave cores is N,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a parallel discrete update method for heterogeneous many-core processors, comprising the following steps: S1. Divide discrete update data areas with continuous addresses into blocks, and map the data blocks to slave cores, so that each The slave core is responsible for the update of one of the data blocks; S2, each slave core classifies the discrete update requests generated during the execution of the instruction into discrete update requests that need to be processed by the slave core itself and discrete update requests that need to be processed by other slave cores , and insert the corresponding processing function where the discrete update request is generated; S3, the discrete update request that needs to be processed from the core itself is processed according to the processing flow of the program segment func1; S4, the discrete update request that needs to be processed from other cores is processed according to the program segment The processing flow of func2 is processed. The invention can realize parallel discrete update with higher performance, and can deal with the discrete update problem of big data that cannot be accommodated in LDM space.

Description

technical field [0001] The invention belongs to the technical field of computer compilation and optimization, in particular to a parallel discrete update method for heterogeneous many-core processors. Background technique [0002] There are two types of discrete updates: one is write first and then read, indicating that there is a correlation between loop iterations, and such loops usually cannot be parallelized; the other is write-related, as long as the atomicity of the write operation is guaranteed, that is Parallelization is possible. [0003] The producer-consumer model is usually used on heterogeneous many-core platforms to solve the discrete update problem: the slave core is divided into two parts: the computing core (producer) and the update core (consumer), and the producer is responsible for discrete update requests (intermediate update requests). The calculation result), determine which consumer the update request should belong to, and send it to the correspondin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/23
CPCG06F16/23Y02D10/00
Inventor 姜小成尤洪涛毛兴权顾龙张立博孙俊
Owner JIANGNAN INST OF COMPUTING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products