Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A robot out-of-sequence workpiece grasping method based on deep inverse reinforcement learning

A reinforcement learning and robotics technology, applied in manipulators, manufacturing tools, program-controlled manipulators, etc., can solve the problems of low efficiency, hidden dangers, long cycle, etc., to achieve the effect of low efficiency, meet the needs of industrial production, and long cycle

Active Publication Date: 2020-10-16
ZHEJIANG UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, traditional reinforcement learning methods have great limitations when solving high-dimensional state and action space problems. Under the condition of limited samples and computing units, the ability to express complex functions is limited, and the performance in practical applications is often not ideal.
At the same time, the traditional deep reinforcement learning algorithm needs to provide a large amount of data for training. During the training process, the robot needs to continue to grasp and try and make mistakes, so that it is possible to obtain a stable grasping ability.
This training method has a long cycle and low efficiency, and there are safety hazards in the actual training process, which often cannot meet the needs of industrial production applications.
Moreover, in the actual multi-step reinforcement learning process, the design of the reward function is very difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A robot out-of-sequence workpiece grasping method based on deep inverse reinforcement learning
  • A robot out-of-sequence workpiece grasping method based on deep inverse reinforcement learning
  • A robot out-of-sequence workpiece grasping method based on deep inverse reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with drawings and embodiments.

[0042] as attached figure 1 Shown, specific embodiment of the present invention and its implementation process are as follows:

[0043] The depth camera adopts a binocular vision sensor, which is set directly above the object to be captured, and can take pictures of the object to be captured and output point cloud data. The robot is a six-axis industrial robot, which is set on a horizontal plane. The method of the present invention needs to train the point cloud classification network, the position generation network and the attitude generation network first, and the grasping pose estimation can only be implemented after the network training is completed.

[0044]In this method, the tool center point of the robot is set as the clamping midpoint of the end effector, the x-axis direction of the tool center point is the forward direction of the end effector, the z-axis d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robot out-of-order workpiece grabbing method based on deep reverse reinforcement learning. A visual sensor shoots image information o a workpiece to be grabbed, and the information is transmitted to an information processing unit; the information processing unit generates three-dimensional point cloud of target workpiece extracting for the shot image; the treated three-dimensional point cloud data are input to deep reverse reinforcement learning, and a robot motion path is calculated; and the robot carries out workpiece grabbing according to the motion path obtained through calculation. The industrial production needs can be met, little expert demonstration data need to be demonstrated, grabbing programming of specific workpieces can be rapidly achieved, and the defects that the deep reinforcement learning. method strategy is poor in adaptation, limited in grabbing capacity, long in training period, low in efficiency and the like are overcome.

Description

technical field [0001] The invention relates to a robot workpiece grasping method belonging to artificial intelligence, in particular to a robot out-of-sequence workpiece grasping method based on deep reverse reinforcement learning. Background technique [0002] As one of the world's top five industrial robot consumers, China's installation volume increased to 36.0% of the world in 2018. A total of 138,000 industrial robots were installed, a year-on-year increase of 59%. The consumption volume has exceeded the sum of Europe and the United States. Intelligent manufacturing is the main direction of Made in China 2025, and there is a huge demand for intelligent industrial robots. The application of robots for handling and loading and unloading accounts for more than two-thirds, and the added value brought by intelligent upgrading is obvious. [0003] With the development of artificial intelligence deep learning, it has begun to study the deep learning of workpiece vision based...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): B25J9/16
CPCB25J9/1602B25J9/1664B25J9/1697
Inventor 傅建中王郑拓徐月同杨波
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products