Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Reinforced learning based robot joint motion control method and system

A technology of robotic joints and reinforcement learning, applied in the direction of program control manipulators, manipulators, manufacturing tools, etc., can solve problems such as only consideration, value function influence, error accumulation, etc., to reduce influence, avoid accumulation, and compensate accurately and efficiently. Effect

Active Publication Date: 2019-04-12
XIAMEN UNIV
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage of this method is that the errors will continue to accumulate, and slight changes in the strategy will have a great impact on the value function
The other is a method based on strategy search, also known as actor-only. This method directly improves the strategy. This method performs well in solving continuous state problems and is widely used in the field of robotics. However, only the current strategy is considered in the training process. Or several adjacent strategies, therefore, there is a local optimal problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reinforced learning based robot joint motion control method and system
  • Reinforced learning based robot joint motion control method and system
  • Reinforced learning based robot joint motion control method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0050] The purpose of the present invention is to provide a robot joint motion control method and system based on reinforcement learning, which has the characteristics of small error and high efficiency.

[0051] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0052] ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a reinforced learning based robot joint motion control method and system. The method comprises the following steps: obtaining to-be-operated track of a robot terminal; calculating position increment within each interpolation period of the robot joint according to the to-be-operated track of the robot terminal and a robot inverse kinematic model; determining position increment compensation within each interpolation period of the robot joint according to a policy network; taking sum of given position increment within each interpolation period and position increment compensation as motion parameters of the robot joint, inputting the motion parameters into a robot to obtain practical motion amount within each interpolation period of the robot joint; performing real-timetraining update on a value network according to the given position increment and the practical motion amount; after operation of the to-be-operated track is accomplished, performing training update on the policy network according to parameters updated according to each interpolation period, of the value network; and controlling motion, in next to-be-operated track, of the robot joint by adoptingthe updated policy network. The reinforced learning based robot joint motion control method has the characteristics of being small in errors and high in efficiency.

Description

technical field [0001] The invention relates to the field of robot control, in particular to a robot joint motion control method and system based on reinforcement learning. Background technique [0002] There is a problem of trajectory deviation at the end of the robot during operation, and reinforcement learning can be used to compensate and reduce the deviation. There are currently two main types of reinforcement learning applications in this field. One is the method based on the value function, also known as critic-only, which is to derive the corresponding optimal strategy by observing and evaluating the performance of the system. The disadvantage of this method is that the errors will continue to accumulate, and slight changes in the strategy will have a great impact on the value function. The other is a method based on strategy search, also known as actor-only. This method directly improves the strategy. This method performs well in solving continuous state problems ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): B25J9/16
CPCB25J9/1602B25J9/163B25J9/1664
Inventor 刘暾东贺苗吴晓敏高凤强王若宇
Owner XIAMEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products