Method for controlling mobile robot through automatic establishment of abstract action

A mobile robot and motion control technology, applied in the direction of non-electric variable control, control/adjustment system, two-dimensional position/channel control, etc., can solve the problems of increasing number of features, decreasing algorithm performance, increasing algorithm complexity, etc., to achieve The effect of accelerated learning

Active Publication Date: 2016-08-17
SUZHOU UNIV
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, when solving large-scale problems with big data, the performance of traditional reinforcement learning algorithms will drop sharply, because reinforcement learning needs to continuously interact with the environment through robots, so when the problem scale expands, the number of features in the environment will increase exponentially, Therefore, the complexity of the algorithm will rise sharply, and the robot needs a lot of time and experience to interact with the environment to obtain a good strategy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for controlling mobile robot through automatic establishment of abstract action
  • Method for controlling mobile robot through automatic establishment of abstract action
  • Method for controlling mobile robot through automatic establishment of abstract action

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention will be further described below in conjunction with accompanying drawing and embodiment:

[0043] Example: refer to figure 1 , 2 , 5, a method for a mobile robot to automatically create an abstract action, specifically comprising the following steps:

[0044] 1) Initialize the module, set the operating environment of the robot, the reward value for reaching the goal, and the step size parameter and discount factor parameter for updating the value. The operating environment includes the boundary of the environment, the starting position of the task, the end position, and the environment The position of the fixed obstacle, each position is regarded as a state;

[0045] 2) The robot updates all state-action evaluation values ​​according to the update formula of the Q learning module , and save the set of learned experience trajectories , a trajectory can be expressed as ,in Indicates the trajectory The jth state of , and then for the obtai...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for controlling a mobile robot through automatic establishment of abstract action. The method comprises the steps of 1, initialization; 2, updating all state-action assessed values according to a Q-learning module updating formula, saving an experiment track, and removing the states in repeated state intervals in the experiment track; 3, calculating the diversity density values of residual states; 4, executing step 5 if the diversity density value of a certain state reaches a preset threshold value, and repeating step 2, 3 and 4 otherwise; 5, taking states with diversity density values larger than the threshold value as sub-goal points, establishing an abstract action module, setting the start state, strategy and stop condition of the abstract action, adding the abstract action module to a selectable action set of the robot, and controlling the mobile robot with the updated selectable action set. By the adoption of the method, the abstract action can be effectively established, the robot can conduct hierarchical reinforcement learning by means of the established abstract action, and then the learning speed of the robot in a complicated environment is increased.

Description

technical field [0001] The invention relates to a robot control method, which belongs to the field of machine learning, in particular to a method for controlling a mobile robot by using abstract actions, especially a method for the mobile robot to automatically create abstract actions to realize movement control. Background technique [0002] With the continuous expansion of the application field of robots, the tasks faced by robots are becoming more and more complex. Although many algorithms can pre-design and program the possible behaviors of robots, when faced with complex tasks, designers often cannot make effective predictions for the rapidly changing external environment, so they cannot make reasonable predictions about the behavior of robots. Therefore, autonomous robots that can perceive the environment and complete tasks by taking relevant actions have become the focus of research. [0003] Reinforcement Learning (Reinforcement Learning), represented by path planni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/02
CPCG05D1/0221G05D2201/0217
Inventor 朱斐伏玉琛刘全陈冬火金海东
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products