Convolutional neural network parallel processing method based on OpenCL
A convolutional neural network and parallel processing technology, applied in the field of parallel processing of convolutional neural networks, can solve problems such as high model complexity and slow running speed, and achieve the effect of overcoming convolution operations, improving running speed, and overcoming complex structures
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The present invention will be further described below in conjunction with the accompanying drawings.
[0036] Refer to attached figure 1 , to further describe the specific steps of the present invention.
[0037] Step 1, obtain the reorganization matrix of the image data matrix.
[0038] Read image data matrix in host memory.
[0039] Create an image data matrix cache object in the GPU global memory, and transfer the image data matrix from the host memory to the image data matrix cache object.
[0040] According to the size of the convolution kernel and the number of channels of the image data matrix, the number of rows of the reorganization matrix is calculated, and the image data matrix is combined and rearranged in parallel to obtain a two-dimensional reorganization matrix. The size of the reorganization matrix is K×N.
[0041] The specific steps of the parallel combination rearrangement are as follows:
[0042] Step 1, according to the following formula, c...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com