Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mechanical arm imitation learning method based on meta-action hierarchical generalization

A learning method and meta-action technology, applied in the field of imitation learning of robotic arms, can solve the problems of a large number of labor costs and high cost of expert teaching, and achieve the effects of improving accuracy, avoiding repeated training, and improving training efficiency

Active Publication Date: 2022-07-01
ZHEJIANG UNIV OF TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem that imitation learning requires a large amount of expert teaching data in the strategy training process, and repeated manual teaching requires a lot of labor costs, and the cost of expert teaching is too high in some special environments, the present invention provides a method based on meta-action analysis. layer generalization method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mechanical arm imitation learning method based on meta-action hierarchical generalization
  • Mechanical arm imitation learning method based on meta-action hierarchical generalization
  • Mechanical arm imitation learning method based on meta-action hierarchical generalization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0038] Step S1: Move the manipulator by the hand of the expert, record the state of the manipulator joints with multiple heterogeneous sensors, and obtain the expert teaching data set, which is expressed as φ(s, a), where s represents the state data of each joint of the manipulator, including Spatial pose, moment, direction angle, etc.; a represents the robotic arm action mapped by the current state, recorded in a sparse matrix;

[0039] Step S2: Input the collected expert teaching data set φ(s, a) figure 1 In the generalization system in , the expert teaching dataset is decomposed by clustering, and the decomposed meta-action set is expressed as τ(A 1 , A 2 , …, A k ), where A 1 represents the first meta-action, and the subscript k represents the number of meta-actions included in the expert teaching;

[0040] Step S3: Calculate the weight of each meta action in the whole teaching action {π 1 , π 2 , …, π k }, where the weight coefficient π k One-to-one correspondence...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A mechanical arm imitation learning method based on meta-action hierarchical generalization comprises the steps that an expert teaching data set is obtained and expressed as phi (s, a), s represents state data of all joints of a mechanical arm and comprises the spatial pose, the torque and the direction angle, and a represents the mechanical arm action mapped by the current state; the expert teaching data set is decomposed in a clustering mode, and a decomposed meta-action set is obtained; calculating the weight {pi1, pi2,..., pik} of each element action in the whole teaching action, wherein the weight coefficient pik is in one-to-one correspondence with the element action; according to the weight coefficient of each element action, generalization is carried out according to different proportions, and a generalization action is generated; sub-actions are randomly selected from the generalized meta-actions tau (A1, A2,..., Ak) to be combined, generalization teaching psi is obtained, the sub-actions are arranged according to expert teaching, and target actions identical to original expert teaching are formed; and inputting the generalization teaching psi into a convolutional neural network for supervised learning to obtain an execution strategy of the target action. According to the invention, the training efficiency and the accuracy of meta-actions are improved.

Description

Technical field [0001] The invention belongs to the field of robotic arm imitation learning, and specifically relates to a robotic arm imitation learning method based on hierarchical generalization of meta-actions. Background technique [0002] With the increasing use of intelligent robots and robotic arms in fields such as medical assistance, industrial manufacturing, express sorting, etc., although the traditional teaching programming method can achieve imitation learning and output of simple actions in specific scenarios, its learning The process requires the acquisition of a large number of manual teaching data sets, resulting in a large amount of manual labor redundancy; at the same time, the stability and robustness of the system working under continuous action need to be improved. In the invention patent CN 111983922 A, Lei Qujiang, Li Xiuhao and others from the Institute of Advanced Technology, Chinese Academy of Sciences in Guangzhou disclosed a robot demonstration ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): B25J9/16B25J9/22
CPCB25J9/16B25J9/1628B25J9/0081B25J9/161
Inventor 张文安姜国栋付明磊刘锦元刘安东杨旭升史秀纺仇翔滕游周叶剑吴麒胡佛
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products