Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

PID locomotive automatic driving optimal control method based on reinforcement learning

An optimized control and automatic driving technology, applied in non-electric variable control, two-dimensional position/course control, vehicle position/route/altitude control, etc., can solve the problem of affecting PID control performance, unable to use large delay systems, Difficulty in parameter setting and other problems, to achieve the effect of improving the optimization effect and reducing the difficulty of manual design

Active Publication Date: 2018-04-20
TSINGHUA UNIV
View PDF2 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The classic control algorithm and the improved PID control algorithm are easy to implement and have good robustness, but there are difficulties in parameter tuning; intelligent control algorithms also have their own characteristics, such as fuzzy control, which is difficult to establish an accurate model but can be controlled according to experience. The control effect is better , but the design of fuzzy rules is too dependent on manual design and cannot be applied to systems with large delays; expert systems can make better use of expert experience and knowledge, but there are deficiencies in knowledge acquisition relying on manual work and weak reasoning ability
Because various intelligent control algorithms have their own characteristics and have the possibility of complementarity, the integrated intelligent control algorithm is combined according to the advantages of different intelligent control algorithms, but it still cannot completely avoid the shortcomings of the combined intelligent control algorithm itself.
Such as fuzzy predictive control, although the control effect is improved, but the fuzzy rules still need to be manually designed
[0004] Although there have been a lot of theoretical research results on the control algorithm, most of the locomotive control algorithms currently in use still use the PID control algorithm, but the limitations of the PID control algorithm itself determine that the control performance cannot be optimal.
In addition, due to the complex and changeable characteristics of the locomotive operating environment, it will also affect the performance of PID control

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • PID locomotive automatic driving optimal control method based on reinforcement learning
  • PID locomotive automatic driving optimal control method based on reinforcement learning
  • PID locomotive automatic driving optimal control method based on reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The present invention will be described in detail below with reference to the accompanying drawings and the embodiments thereof, but the protection scope of the present invention is not limited to the scope described in the embodiments.

[0029] In order to make the present invention clearer, the present invention is described in detail below.

[0030] The first embodiment of the present invention provides a PID locomotive automatic driving control method based on reinforcement learning, its processing process is as follows figure 1 shown, including:

[0031] Step S101, obtaining state information such as the speed difference between the actual running speed and the optimal speed of the locomotive, and current line information.

[0032] The LKJ (locomotive running monitoring device) on the locomotive can record the actual running speed of the locomotive, so the actual running speed of the locomotive can be obtained from the LKJ.

[0033] The optimal speed of the locom...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a PID locomotive automatic driving optimal control method based on reinforcement learning. The PID locomotive automatic driving optimal control method comprises the steps of: regarding current route information and a speed difference between a locomotive actual running speed and an optimal speed as input of reinforcement PID at first, acquiring a group of optimal PID controlparameters through reinforcement learning, realizing PID control according to the optimal PID control parameters provided by a reinforcement learning module, providing a control quantity, and controlling operation of a locomotive. The PID locomotive automatic driving optimal control method applies reinforcement learning to PID parameter adjustment, can interact with the environment by means of reinforcement learning, has the self-learning capability, can realize the PID control having the optimal parameter group, improves the optimization effect, and reduces manual design difficulty.

Description

technical field [0001] The invention relates to locomotive operation optimization manipulation technology, in particular to a PID (proportional, integral, differential) locomotive automatic driving optimization control method based on reinforcement learning based on reinforcement. Background technique [0002] The optimal control of locomotive automatic driving is one of the core functions of the locomotive automatic control system. Its main function is to calculate the corresponding control input quantity based on the optimal speed curve trajectory, based on the actual motion state and combined with the following control algorithm, and convert the control input quantity It acts on the actual operation process of the locomotive to complete the speed control. The locomotive automatic driving system needs to achieve indicators such as punctuality, comfort and energy-saving operation. The optimal generation of the optimal speed curve is the guarantee for the locomotive automat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G05D1/02
CPCG05D1/0221G05D1/0276
Inventor 黄晋卢莎赵曦滨高跃夏雅楠
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products