Deep learning job operation method and device

A technology of deep learning and operation method, applied in the computer field, can solve the problems of inconvenient expansion and maintenance, lack of standards, and incomplete implementation, and achieve the effect of facilitating expansion and maintenance

Inactive Publication Date: 2020-02-11
INSPUR SUZHOU INTELLIGENT TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are priority options for deep learning jobs (jobs) when scheduling in Kube-bath, while Kubeflow contains a variety of operators, and each job will have a priority setting, but there has never been a unified The standard, and it has not been fully implemented, which leads to great inconvenience in the process of subsequent expansion and maintenance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning job operation method and device
  • Deep learning job operation method and device
  • Deep learning job operation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0056] Optionally, in another embodiment of the present invention, an implementation after step S203 further includes:

[0057] Sort each deep learning job and pending deep learning jobs in the running queue according to their running priority.

[0058] In the specific implementation process of the embodiment of the present invention, after adding the deep learning job to be processed in the running queue, all the deep learning jobs in the running queue are reordered according to the size of the running priority, and the running deep learning job runs After the end, the deep learning job with the highest priority in the run queue will be run first.

[0059] Optionally, in another embodiment of the present invention, an implementation of the method for running a deep learning job further includes:

[0060] The information of the deep learning job in the running strategy and the scheduling policy corresponding to each type of deep learning job are updated every preset time.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an operation method and device for deep learning jobs. The method comprises the steps that deep learning job information to be processed is acquired; wherein the to-be-processeddeep learning job information comprises the type of the to-be-processed deep learning job; matching in a preset operation strategy according to the type of the deep learning job to be processed to obtain a scheduling strategy corresponding to the type of the deep learning job to be processed; wherein the scheduling strategy comprises operation priorities of the deep learning jobs; and finally, running the deep learning job to be processed according to the running priority of the deep learning job. Deep learning homework and an operation strategy are separated; the two parts are independent; only when the deep learning job needs to be operated, a scheduling strategy corresponding to the type of the deep learning job is obtained through matching in a preset operation strategy; wherein the scheduling strategy comprises operation priorities of the deep learning jobs, so that the purpose of unifying priority strategies of all operators to facilitate subsequent extension and maintenance isachieved.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a method and device for running a deep learning job. Background technique [0002] With the rapid development of computer technology, more and more applications of Kubernetes, Kubernetes is an open source platform for automatic deployment, expansion and operation and maintenance of container clusters, which can be abbreviated as K8s; Kube-bath is a batch of K8s The processing scheduler provides a mechanism for applications that want to run K8s batch jobs; kubeflow is a glue project that combines many support for machine learning, such as model training, hyperparameter training, model deployment, etc., into containers Deploy in a standardized way to provide high availability and convenient expansion of each system in the entire process. Users who deploy kubeflow can use it to perform different machine learning and deep learning tasks. Different types of deep learning jo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/50G06N3/063
CPCG06F9/5038G06N3/063
Inventor 李铭琨
Owner INSPUR SUZHOU INTELLIGENT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products