Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Model training method, device and equipment based on pipeline parallelism

A model training and assembly line technology, applied in the field of machine learning, to achieve the effect of improving parallelism, ensuring training effect, and improving equipment utilization

Active Publication Date: 2021-07-27
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional model parallel technology can only ensure that one working machine is running at the same time, and the others are vacant.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model training method, device and equipment based on pipeline parallelism
  • Model training method, device and equipment based on pipeline parallelism
  • Model training method, device and equipment based on pipeline parallelism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The embodiments of this specification provide a model training method, device, device and storage medium based on pipeline parallelism.

[0050] In order to enable those skilled in the art to better understand the technical solutions in this specification, the technical solutions in the embodiments of this specification will be clearly and completely described below in conjunction with the drawings in the embodiments of this specification. Obviously, the described The embodiments are only some of the embodiments of the present application, but not all of them. Based on the embodiments of this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0051]In one or more embodiments of this specification, a worker refers to a device, a process, etc. used to perform model training, generally a single worker (may be called a worker), or a working machine co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a model training method based on pipeline parallelism. The model training method comprises the steps of segmenting a to-be-trained model according to a normal training sequence to obtain a plurality of sub-slices; dividing a part of sub-slices with the normal training sequence in the front into a first group to form a first sub-model, and dividing the remaining sub-slices into a second group to form a second sub-model; distributing the sub-slices contained in the first sub-model to different working parties, and distributing the sub-slices contained in the second sub-model to different working parties; according to the training sequence obtained through adjustment, distributing training time blocks to the sub-slices on the working party to which the sub-slices are distributed, wherein the training sequence comprises the following steps: for a single-round training sample, sequentially executing the forward propagation of the first sub-model in the current period, the backward propagation of the first sub-model in the previous period, the forward propagation of the second sub-model in the current period and the backward propagation of the second sub-model in the current period; and according to the training time block, carrying out parallel training on the to-be-trained model on the plurality of working parties.

Description

technical field [0001] This specification relates to the field of machine learning technology, and in particular to a pipeline-based parallel model training method, device and equipment. Background technique [0002] In recent years, deep learning has greatly promoted the development of artificial intelligence technology, and has achieved good results in many fields. However, with the development of artificial intelligence technology, the parameter scale of the model is also increasing, and its growth rate has even far exceeded the growth rate of hardware processing performance. [0003] In order to solve this contradiction, model parallel technology has gradually attracted the attention of researchers. Model parallelism can divide a large-scale model into multiple model slices and distribute them to different working machines, so as to realize the training or reasoning of large-scale models. The traditional model parallel technology can only ensure that one working machin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/084G06N3/045
Inventor 杨鹏程张晓明杨明魏宏
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products