Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Quadruped robot gait control method based on reinforcement learning and CPG controller

A quadruped robot and gait control technology, applied in the field of robotics, can solve the problems of robot overturning, large change range, low reproducibility, etc., and achieve the effect of stable torso, prevention of overturning, and small change range

Pending Publication Date: 2020-05-29
SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, using this method has great limitations: 1. It is difficult to further modify and intervene in the leg motion planning after setting up the controller
2. Since only the timing problem is considered, the setting of the joint motion of the robot is not necessarily reasonable, resulting in a large change in the pitch angle and lateral angle of the robot torso during the motion process, which does not meet the requirements for smooth motion. robot capsize
But there are still some deficiencies: 1. The training neural network is too large and the reproducibility is low; these robot motion control networks are often trained by a large number of engineers and researchers for several months to achieve the effect. Reward and punishment functions are also extremely complex and difficult to transfer and reproduce; 2. The footed robots trained by existing reinforcement learning algorithms have abandoned a characteristic of footed robots: timing
The movement of the legs according to a certain sequence is a characteristic of mammals, and it is also a necessary condition for mammals to show faster and stronger ground adaptability. Robots controlled by reinforcement learning algorithms that remove the sequence will have poor adaptability The problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quadruped robot gait control method based on reinforcement learning and CPG controller
  • Quadruped robot gait control method based on reinforcement learning and CPG controller
  • Quadruped robot gait control method based on reinforcement learning and CPG controller

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings. It should be emphasized that the following description is only exemplary and not intended to limit the scope of the invention and its application.

[0043] The design concept of the present invention is to assign an oscillator to each leg of a quadruped robot, and each oscillator has a strict phase relationship; for each leg, a reinforcement learning neural network is deployed to make the neural network The position autonomously plans a fast movement trajectory, so that the foot end of the leg reaches the target position as soon as possible.

[0044]Due to the development of the field of artificial intelligence, the theories in the field of deep learning and reinforcement learning have begun to be applied to robot kinematics planning. The characteristics of reinforcement learning, that is, there is no need to give a sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a quadruped robot gait control method based on reinforcement learning and a CPG controller. The quadruped robot gait control method comprises the steps that S1, building a single-leg model and an overall model of a quadruped robot; S2, constructing an actor neural network and a reviewer neural network to train the single-leg model; S3, determining a relative sequential relationship between legs of the quadruped robot and constructing a controller model according to leg movement characteristics; and S4, constructing a foot end motion trail model of the quadruped robot from an output signal of the controller model through mapping transformation, and taking the output of the foot end motion trail model as a gait control signal of the quadruped robot to drive the quadruped robot. According to the method, it is guaranteed that legs of the quadruped robot move strictly according to the time sequence, the gait characteristic of mammals is achieved, meanwhile, it is guaranteed that the trunk of the robot is stable in the moving process, the pitch angle and the lateral angle change range is small, and overturning is effectively prevented; the leg movement has self-adaptability, and a complex control and planning algorithm does not need to be artificially designed.

Description

technical field [0001] The invention relates to the technical field of robots, in particular to a method for controlling the gait of a quadruped robot based on reinforcement learning and a CPG controller. Background technique [0002] In the field of gait control of quadruped robots, the central rhythm controller CPG is one of the simplest and most classic control methods. First, a suitable rhythm controller is constructed to assign a controller to each leg of the quadruped robot, so that the robot Exercise according to a certain sequence, and complete simple actions such as forward or backward. However, using this method has great limitations: 1. It is difficult to further modify and intervene in the leg motion planning after the controller is set. 2. Since only the timing problem is considered, the setting of the joint motion of the robot is not necessarily reasonable, resulting in a large change in the pitch angle and lateral angle of the robot torso during the motion pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G05D1/02G05D1/08
CPCG05D1/0246G05D1/0223G05D1/0221G05D1/0891
Inventor 刘厚德于天宁王学谦梁斌朱晓俊高学海
Owner SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products