Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mechanical arm motion planning method based on deep reinforcement learning

A technology of reinforcement learning and motion planning, applied in manipulators, program-controlled manipulators, manufacturing tools, etc., can solve the problems of large nonlinear function approximators, difficult to deal with high-dimensional continuous action space, instability and other problems

Pending Publication Date: 2020-09-11
NANJING UNIV +1
View PDF5 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Before DQN, it was widely believed that learning value functions using large nonlinear function approximators was difficult and unstable
The algorithm combines deep learning with reinforcement learning, and can use a function-like approximator to learn the value function in a stable manner. In order to minimize the correlation between samples, the network uses samples in the buffer for offline training, but it is difficult to cope with High-dimensional continuous action space
The Actor-Critic method can handle continuous action space problems, but the use of random strategies makes it difficult for the network to converge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mechanical arm motion planning method based on deep reinforcement learning
  • Mechanical arm motion planning method based on deep reinforcement learning
  • Mechanical arm motion planning method based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0037] This embodiment provides a method for motion planning of a robotic arm based on deep reinforcement learning, taking a 6-DOF robotic arm as an example for illustration, specifically including the following steps:

[0038] Step 1. The image acquisition device acquires an environment image before the movement of the manipulator. The environment image includes the manipulator in the initial state, moving target points and intermediate obstacles to obtain the initial planning space. The simplified schematic diagram is shown in 1.

[0039] Step 2, according to the collected environmental image, use the target segmentation algorithm to separate the forbidden area ( figure 1 Middle gray area), the working area is the movement space and target position of the end of the manipulator except the forbidden area in the planning space, and the initial...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a mechanical arm motion planning method based on deep reinforcement learning. The method comprises the following steps of 1, acquiring an environment image once before a mechanical arm moves, wherein the environment image comprises the mechanical arm in an initial state, a moving target point and an intermediate obstacle; 2, according to the acquired environment image, utilizing the target partitioning algorithm to separate a prohibited area, a working area and a target position from one another, and reconstructing a planning space; 3, dividing the reconstructed planning space into three-dimensional grid spaces, and establishing binarized grid spaces; 4, solving the corresponding analytical solution of each joint of the mechanical arm under known terminal coordinates by utilizing the robot inverse kinematics, and determining the relative positional relationships between the mechanical arm and the planned space boundary, the prohibited area boundary and the moving target under the global coordinate system; and 5, planning the motion strategy for the mechanical arm and acquiring the optimal motion strategy, so that the mechanical arm moves to the target position at the minimum cost under the premise of avoiding the obstacle.

Description

technical field [0001] The invention relates to a dynamic intelligent planning method based on reinforcement learning, in particular to a motion planning method of a 6-axis cooperative mechanical arm. Background technique [0002] The traditional trajectory planning of the manipulator mostly adopts the method of trajectory interpolation, combined with high-order polynomials to smooth the position, speed and acceleration of the manipulator. In this method, the motion scheme of the manipulator is relatively fixed, and it is impossible to actively avoid the environment. of obstacles. As the performance requirements of the robotic arm in the industrial field are getting higher and higher, and the robotic arm is required to complete more interactive or dynamic complex space tasks, the traditional method is no longer applicable. [0003] In recent years, reinforcement learning methods have been increasingly used in robot control tasks. Traditional reinforcement learning methods s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): B25J9/16
CPCB25J9/1666B25J9/163B25J9/1697
Inventor 辛博傅汇乔陈春林程旭马晶
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products