Method and device for submitting OpenCL kernel

A kernel, local memory technology, applied in multi-programming devices, program control design, instruments, etc., to achieve the effect of improving efficiency

Pending Publication Date: 2021-03-02
中国船舶集团有限公司 +1
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The OpenCL standard only stipulates a general programming framework, and the specific calculation methods still need to be designed by major manufacturers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for submitting OpenCL kernel
  • Method and device for submitting OpenCL kernel
  • Method and device for submitting OpenCL kernel

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments, but these embodiments should not be construed as limiting the present invention.

[0029] Such as figure 1 As shown, the present invention provides a kind of method that OpenCL kernel submits, and it comprises the steps:

[0030] S1. After the kernel is created in the host program, the kernel information is passed to the kernel submission module of OpenCL.

[0031] S2. Check whether the dimension of the work item satisfies the condition for converting from one dimension to two dimension, and if so, perform conversion.

[0032] Optionally, as in figure 2 As shown, the step S2 includes the following sub-steps:

[0033] S21. Determine the size of the work item dimension and the value of GlobalWorkSize (the number of global work items). If the dimension is 1 and the value of GlobalWorkSize is greater than the two conditions of MaxGlobalWorkSize ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for submitting an OpenCL kernel. The method comprises the following steps of 1, transmitting kernel information to the OpenCL kernel; 2, converting the dimension of the work item from one dimension to two dimensions; 3, checking whether the OpenCL kernel uses a local memory or not, and applying for space storage if the OpenCL kernel uses the local memory; 4, checking the size of a local working group; and 5, buffering and submitting the command containing the OpenCL kernel information to hardware equipment. The invention further discloses a device for submitting the OpenCL kernel. The device comprises a kernel information creating and submitting module; a work item dimension conversion module; a local memory variable module; a local work group module; and an OpenCL working thread module. On the premise that the OpenCL standard framework is met, work tasks are reasonably distributed to hardware, the OpenCL kernel is submitted to the hardware to be executed, the parallel operation efficiency is improved, and the method and the device can be widely applied to the field of high-performance parallel calculation.

Description

technical field [0001] The invention relates to the field of computer high-performance parallel computing, in particular to a method and device for submitting an OpenCL kernel. Background technique [0002] In recent years, high-performance parallel computing technology has developed rapidly, and OpenCL, as a general standard for parallel computing, has been widely adopted by major chip manufacturers. [0003] OpenCL is created to solve the problem of parallel computing between hardware with different architectures. The OpenCL standard allows developers to focus on the design and optimization of algorithms without paying attention to the programming language of each platform and the details of the underlying hardware. Improved development efficiency. [0004] Generally speaking, the OpenCL kernel is a function executed in the hardware. Each computing unit in the computing device will execute the same function, but the data used is different, so as to achieve the purpose of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/50
CPCG06F9/5027
Inventor 许世文樊宇解文华张坤
Owner 中国船舶集团有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products