Deep learning model training method, working node and parameter server

A technology for model training and working nodes, applied in the field of deep learning, which can solve problems such as low model training efficiency

Pending Publication Date: 2020-12-01
LYNXI TECH CO LTD
View PDF4 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a deep learning model training method, working nodes, and parameter servers to solve the problem of low model training efficiency in the process of using multiple working nodes to train the same deep learning model in the related art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning model training method, working node and parameter server
  • Deep learning model training method, working node and parameter server
  • Deep learning model training method, working node and parameter server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0034] In the training process of large-scale deep learning models, in order to speed up model convergence and improve training efficiency, and considering that the total number of samples may be large (it is impossible to use all sample data for model iteration), the small batch gradient descent method is usually used for model training. . Among them, when using the small batch gradient descent method for model training, each iteration uses batch size...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a deep learning model training method, a working node and a parameter server. The deep learning model training method applied to the working node comprises thesteps that a first statistical parameter sent by a parameter server is received, and the first statistical parameter is determined by the parameter server according to historical training data of a target layer of a target model; when the target layer is trained based on target batch training samples, target statistical parameters of the target layer are obtained, and the target statistical parameters are statistical parameters of the target batch training samples; and an actual statistical parameter of the target layer is determined based on the first statistical parameter and the target statistical parameter, batch standardization is performed on the target batch training samples based on the actual statistical parameter, and the target statistical parameter is sent to the parameter server. According to the embodiment of the invention, the training efficiency of the deep learning model can be improved.

Description

technical field [0001] The invention relates to the technical field of deep learning, in particular to a deep learning model training method, a working node and a parameter server. Background technique [0002] With the development of information technology, the use of deep learning models for training to use the trained models to predict target data has been more and more widely used. In order to further improve the accuracy of the trained models, the training sample The number is also increasing, which results in the complexity of training and longer training time. [0003] In related technologies, usually multiple working nodes can be used to train the same model. For example, different working nodes are responsible for training different training layers in the same model. At this time, the next training layer needs to wait for the training of the previous training layer to complete. Being able to perform a training process whose waiting time greatly increases the overal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 徐茂轩吴臻志
Owner LYNXI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products