Unmanned ship reinforcement learning controller structure with data drive and design method thereof

An unmanned ship and controller technology, applied in the direction of adaptive control, general control system, control/regulation system, etc., can solve the problem of poor robustness, cumbersome establishment and setting of controller parameters, and no adaptive learning ability. And other issues

Active Publication Date: 2019-05-31
DALIAN MARITIME UNIVERSITY
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The controller design method based on the unmanned ship model has the advantages of interpretability and intuition, and has been widely studied and applied. However, for complex and changeable environments or complex controlled objects, the precise unmanned ship mathematical model The establishment and setting of controller parameters will be very cumbersome and not conducive to the adjustment of parameters
In addition, traditional controllers have fixed parameters and do not have adaptive learning capabilities. Even if there are some adaptive parameter adjustment methods, they are often limited by the expressiveness and robustness of artificially established unmanned ship models and environmental models. poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unmanned ship reinforcement learning controller structure with data drive and design method thereof
  • Unmanned ship reinforcement learning controller structure with data drive and design method thereof
  • Unmanned ship reinforcement learning controller structure with data drive and design method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The structure of a data-driven unmanned ship reinforcement learning controller related to the present invention is as follows: figure 1 shown. The present invention will be further described below with regard to a specific unmanned ship tracking a moving target simulation as an example.

[0049] A data-driven design method for the reinforcement learning controller structure of unmanned ships satisfies formulas (1)-(9), and the specific parameters are as follows:

[0050] In this embodiment, the unmanned ship is an underactuated unmanned ship, that is, the lateral velocity control input component τ v is 0, the longitudinal velocity control input component τ u Divided into seven gears [-10 -4 -2 0 2 4 10], the input component of yaw rate control τ r Divided into [-5 -1 01 2 5] six levels.

[0051] The tracking target is a moving target, and the initial pose state of the target is:

[0052]

[0053] longitudinal speed horizontal drift speed v r =0, yaw angular ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unmanned ship reinforcement learning controller structure with data drive and a design method thereof. The controller structure comprises an unknown information extraction module, a prediction model generation module, a reward function module and a moving horizon optimization module. The unmanned ship reinforcement learning controller structure with data drive and the design method thereof are based on the data drive with no need for accurate mathematical modeling for a controlled unmanned ship. The controller only employs the unknown information extraction module tocollect control input and output state data information of the unmanned ship and extract a dynamics unknown function, the prediction model generation module reconstructs the extraction information toobtain a prediction model, and the controller does not depend on accurate manual modeling of the unmanned ship. The unmanned ship reinforcement learning controller structure with data drive and the design method thereof do not need to design different controllers for the two levels of the kinematics and the dynamics. Through a prediction model and a set reward function, the control input is subjected to moving horizon optimization to achieve an optimal control effect. The unmanned ship reinforcement learning controller structure with data drive and the design method thereof can be suitable for an all-drive unmanned ship and an under-actuated unmanned ship.

Description

technical field [0001] The invention relates to the technical field of unmanned ship motion control, in particular to a data-driven unmanned ship reinforcement learning controller structure and design method. Background technique [0002] Intelligent unmanned ships are the trend of ship development. In the 21st century, with the rapid development of new concepts and new technologies such as big data and artificial intelligence, the level of ship intelligence has also been continuously improved, and the development and application of intelligent unmanned ships have also had technological support. In the key technical field of intelligent unmanned ships, motion control technology is the premise and basis for realizing autonomous navigation of unmanned ships. [0003] Aiming at the problem of unmanned ship motion control, there are already some feasible technical solutions. For example, patent CN107015562A proposes a control method for underactuated surface ships that meets t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G05B13/04
Inventor 彭周华古楠王丹吕光颢刘陆
Owner DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products