Double-robot force/position multivariate data driving method based on reinforcement learning

A technology of reinforcement learning and multivariate data, applied to manipulators, program-controlled manipulators, manufacturing tools, etc., to solve parameter optimization problems, improve dexterity, and avoid errors

Inactive Publication Date: 2021-09-24
GUANGZHOU INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the environment is unknown, force control is insufficient to generate the desired strength for uncertainties in the environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Double-robot force/position multivariate data driving method based on reinforcement learning
  • Double-robot force/position multivariate data driving method based on reinforcement learning
  • Double-robot force/position multivariate data driving method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the above objects, features, and advantages of the present invention, the technical solutions of the present invention will be described in detail below with reference to the accompanying drawings and specific examples. It should be noted that the described embodiments are merely the embodiments of the present invention, not all of the embodiments, based on the embodiments in the present invention, and those of ordinary skill in the art without creative labor premise Other embodiments belong to the scope of the invention.

[0048] Dual robots in the same station region synergistic, handling and flip, need to study the interaction of robots and the environment, and most common interactive control methods are force-position control. When the environment is unknown, the force / bit control is not sufficient to generate the uncertainty in the environment, and the force / bit control is required, and it is necessary to estimate its expected value.

[0049] Machi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a double-robot force / position multivariate data driving method based on reinforcement learning. A master robot employs an ideal position element control strategy, learns an expected position through a reinforcement learning algorithm, and feeds back an actual position to the expected position, and the purpose is to generate an optimal force when the robot interacts with an environment, so that the position error is minimized; and a slave robot employs a damping PD control strategy suitable for an unknown environment based on the force element control strategy of position deviation of the master robot, learns expected acting force through the reinforcement learning algorithm, and is driven to approach the minimum force of an expected reference point. The master robot and the slave robot respectively learn the expected position and the expected acting force through the reinforcement learning algorithm, and both adopt a proportional differential control rate to set respective differential coefficients (kp) and proportionality coefficients (kd). According to the method, the flexibility of double-machine cooperation can be improved, the parameter optimization problem in force / position control is solved, and large errors in the transient state are avoided.

Description

Technical field [0001] The present invention relates to a multi-robot synergistic control technology, and in particular to a two-machine manpower / bit multivariate data driving method based on an intensive learning. Background technique [0002] With the continuous change of the processing volume of the steel / aluminum, the work environment, some work is only difficult to bear on the single robot, and the synergy between multiple robots is required to be completed. Multi-machine collaborative homework has replaced a single machine and become a research hotspot for building intelligent production lines. Multi-robot systems have the characteristics of data redundancy, robustness, robustness, and better data redundancy, better data redundancy, better system space distribution, better data redundancy, and better data redundancy, robustness. Synergism between multiple robots can be reliably completed complex tasks such as high-precision operations and high-efficiency machining of si...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): B25J9/16
CPCB25J9/1633B25J9/1682B25J9/1661
Inventor 张弓侯至丞杨文林吕浩亮徐征吴月玉李亚锋杨根
Owner GUANGZHOU INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products