Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cloud robot task scheduling method and system based on parallel reinforcement learning

A technology of reinforcement learning and task scheduling, applied to control/regulation systems, instruments, two-dimensional position/channel control, etc., can solve problems such as time increase, unacceptable too much time overhead, and difficulty in obtaining the optimal strategy for the overall problem , to achieve the effect of shortening the learning time and precise optimal strategy

Inactive Publication Date: 2019-04-23
BEIJING WUZI UNIVERSITY
View PDF9 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This kind of episodic tasks can be solved by table solving algorithm, but as the state space expands, the time required to complete the learning increases rapidly, and it is unacceptable for too much time overhead in practical applications; and the approximate solution method uses the finite state space Effective promotion of experience, when encountering unknown situations, generalize similar situations from previously encountered situations. The key lies in the generalization of the problem, and it is difficult to obtain the optimal strategy for the overall problem.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cloud robot task scheduling method and system based on parallel reinforcement learning
  • Cloud robot task scheduling method and system based on parallel reinforcement learning
  • Cloud robot task scheduling method and system based on parallel reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0049] In order to obtain an accurate table solution for the optimal strategy and reduce time overhead as much as possible, the embodiment of the present invention utilizes the parallel computing resources of the cloud platform to propose a cloud robot task scheduling strategy based on parallel reinforcement learning. The scheduling center divides the complex problem into several sub-problems, and the scheduling strategy allocates computing nodes, and the compu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a cloud robot task scheduling method and system based on parallel reinforcement learning. The cloud robot task scheduling method includes the steps that original problems are loaded in a scheduling center, and the original problems are divided into a plurality of sub-problems; the sub-problems and a plurality of computational nodes are matched by the scheduling center; the sub-problems are sent to the computational nodes matched with the sub-problems by the scheduling center; the computational nodes carry out parallel reinforcement learning on the sub-problems and feed learning results of the sub-problems back to the scheduling center; and the scheduling center judges whether the original problems have converged or not according to the learning results of the sub-problems and preset convergence conditions, and if the original problems have converged, optimal strategies of the original problems are output. The method can obtain the precise optimal strategies and reduce the time cost.

Description

technical field [0001] The present invention relates to a cloud robot task scheduling strategy based on parallel reinforcement learning, which belongs to the field of machine learning, relates to the combination and use of cloud robots, distributed computing, and reinforcement learning, and specifically relates to a cloud robot task scheduling based on parallel reinforcement learning methods and systems. Background technique [0002] In recent years, robots have entered a period of rapid development, and the rise in labor costs has given rise to the need to use machines to replace manpower. At present, because the capabilities of robots, especially the level of intelligence and expectations, are far apart, the application of commercial robots is mainly concentrated in the fields of large-scale repetitive production such as automobiles and electronic equipment. With the widespread use of cloud computing, whether it is renting public clouds or deploying local clouds, it provi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0221
Inventor 唐恒亮薛菲刘涛董晨刚
Owner BEIJING WUZI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products