A Pedestrian Sensing and Obstacle Avoidance Method for Service Robots Based on Deep Reinforcement Learning

A service robot and reinforcement learning technology, applied in the field of service robot pedestrian perception and obstacle avoidance based on deep reinforcement learning, can solve problems such as difficult convergence, difficult pedestrian obstacle avoidance mechanism modeling, and slow convergence

Active Publication Date: 2021-05-11
SHANGHAI JIAOTONG UNIV
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the above-mentioned defects of the prior art, the technical problem to be solved by the present invention is to overcome the problem existing in the prior art that it is difficult to model the obstacle avoidance mechanism of pedestrians, and to overcome the end-to-end The training method is usually difficult to converge, or the convergence is very slow

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Pedestrian Sensing and Obstacle Avoidance Method for Service Robots Based on Deep Reinforcement Learning
  • A Pedestrian Sensing and Obstacle Avoidance Method for Service Robots Based on Deep Reinforcement Learning
  • A Pedestrian Sensing and Obstacle Avoidance Method for Service Robots Based on Deep Reinforcement Learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The following describes several preferred embodiments of the present invention with reference to the accompanying drawings, so as to make the technical content clearer and easier to understand. The present invention can be embodied in many different forms of embodiments, and the protection scope of the present invention is not limited to the embodiments mentioned herein.

[0049] In the drawings, components with the same structure are denoted by the same numerals, and components with similar structures or functions are denoted by similar numerals. The size and thickness of each component shown in the drawings are shown arbitrarily, and the present invention does not limit the size and thickness of each component. In order to make the illustration clearer, the thickness of parts is appropriately exaggerated in some places in the drawings.

[0050] Such as figure 1 , figure 2 , image 3 , Figure 4 and Figure 5 As shown, the present invention proposes a service ro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a service robot pedestrian perception obstacle avoidance method based on deep reinforcement learning, and relates to the fields of deep learning and service robot obstacle avoidance. The method is in the training phase: First, the ORCA algorithm is used to generate training data. Then, randomly generate experimental scenarios, use the initialized reinforcement learning model to interact with the environment to generate new training data, and integrate them into the original training data. Finally, use the SGD algorithm to train the network on the new training data to obtain the final network model. In the execution stage of this method: the state of the surrounding pedestrians is obtained through the lidar, the predicted state is calculated according to the trained model and the reward function, and the action that obtains the maximum reward is selected as the output and executed. The invention has strong real-time and adaptability, and can make the robot follow the pedestrian's right-going rule in the pedestrian environment, plan an efficient, safe and natural path, and improve the intelligence and sociality of the service robot.

Description

technical field [0001] The invention relates to the field of deep learning and obstacle avoidance of service robots, in particular to a pedestrian-aware obstacle avoidance method for service robots based on deep reinforcement learning. Background technique [0002] With the increase in labor costs, robots have begun to replace human workers in various fields, especially in the field of public services, such as takeaway robots, express delivery robots, shopping guide robots, etc. The scenes faced by these robots generally have many highly dynamic obstacles, such as pedestrians. How to enable service robots to navigate autonomously in a pedestrian environment and avoid pedestrian obstacles efficiently, safely and naturally has become a key issue that limits the wider application of service robots. In the pedestrian environment, the adaptability of the traditional obstacle avoidance algorithm becomes poor, and sometimes it will show unsafe behaviors such as sudden stop and sha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G05D1/02G06N3/04
CPCG05D1/0231G05D2201/02G06N3/045
Inventor 赵忠华鲁兴龙曹一文晏懿琳
Owner SHANGHAI JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products