A Modeling Method of Collective Communication Functions for Parallel Programs
A technology of collective communication and modeling method, applied in the field of collective communication function modeling, can solve the problems of inaccurate acquisition of communication time data, cost a lot of time and money, etc., to save time and money, and improve the accuracy rate.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0026] Specific implementation mode one: combine figure 1 To illustrate this embodiment, the specific process of a method for modeling collective communication functions of a parallel program in this embodiment is as follows:
[0027] Step 1, measure the collective communication function N times under the experimental platform, and obtain the communication time data of the collective communication function under different degrees of parallelism and data volume;
[0028] The value range of N is 1000-10000;
[0029] Step 2: Use the artificial neural network based on the BP (Error Back Propagation) backpropagation algorithm to fit the communication time data of the collective communication function under different degrees of parallelism and data volume, and obtain the neural network model of the corresponding communication function.
specific Embodiment approach 2
[0030] Specific embodiment two: the difference between this embodiment and specific embodiment one is: use the artificial neural network based on BP (Error Back Propagation) backpropagation algorithm in the described step 2 to set communication function in different degrees of parallelism and data volume The following communication time data are fitted to obtain the neural network model of the corresponding communication function; the specific process is:
[0031] The process of BP backpropagation algorithm is:
[0032] The forward propagation process first receives the input signal, and reaches the output layer through the weights and activation functions between neurons layer by layer to obtain the output value after the current iteration;
[0033] Calculate the error of this round of iteration according to the error definition method;
[0034] According to certain rules, the error is backpropagated from the output layer to the input layer, and the weight is adjusted layer ...
specific Embodiment approach 3
[0047] Specific implementation mode three: the difference between this implementation mode and specific implementation mode one or two is: the data preprocessing in the described step 21; the specific process is:
[0048] The communication time data under different parallelism and data volume is the communication time measured by changing the data volume under a certain parallelism degree or the communication time measured by changing the parallelism degree under a certain data volume;
[0049] The communication time data under different degrees of parallelism and data volume are aggregated and uneven, which has a great impact on training. Therefore, before transmitting the communication time data under different degrees of parallelism and data volume to network training, it is necessary to Shuffle the communication time data under different parallelism and data volume;
[0050]The parameter value update of the neurons in the first hidden layer is proportional to the input val...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com