Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fixed-wing unmanned aerial vehicle cluster control collision avoidance method and device based on deep reinforcement learning

A reinforcement learning and unmanned aerial vehicle technology, applied in vehicle position/route/height control, control/adjustment system, non-electric variable control, etc., can solve the problem of unmanned aerial vehicle collision, and the rotorcraft control strategy cannot be directly applied Fixed-wing UAV swarm control, UAV large collision risk and other issues

Active Publication Date: 2020-10-30
NAT UNIV OF DEFENSE TECH
View PDF7 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the prior art, UAV swarm control solutions based on reinforcement learning are usually mainly aimed at rotor UAVs. Unlike rotor UAVs, due to the non-holonomic constraints of the flight dynamics of fixed-wing UAVs, fixed The swarm control of wing UAVs is more complicated, and the control strategy suitable for rotorcraft cannot be directly applied to the swarm control of fixed-wing UAVs.
[0004] Some practitioners proposed to use deep reinforcement learning method to solve the fixed-wing UAV cluster control problem, but this research is still in its infancy, and it is realized by simplifying the problem. Generally, it is assumed that UAVs fly at different altitudes. The collision problem between UAVs is not considered, but in some practical application scenarios, UAV formations need to fly at the same altitude to perform tasks, and avoiding collisions between UAVs is a problem that must be considered. The control method is used to realize the cluster control of fixed-wing UAVs, and there will be a greater risk of collision between UAVs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fixed-wing unmanned aerial vehicle cluster control collision avoidance method and device based on deep reinforcement learning
  • Fixed-wing unmanned aerial vehicle cluster control collision avoidance method and device based on deep reinforcement learning
  • Fixed-wing unmanned aerial vehicle cluster control collision avoidance method and device based on deep reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] The present invention will be further described below in conjunction with the accompanying drawings and specific preferred embodiments, but the protection scope of the present invention is not limited thereby.

[0059] Such as figure 1 , 2 As shown, the steps of the fixed-wing UAV cluster control collision avoidance method based on deep reinforcement learning in this embodiment include:

[0060] S1. Model training: establish a UAV kinematics model for generating UAV state data and D3QN (Dueling Double Deep Q-Network, competing dual Q-Network) for outputting UAV control instructions, and use various The historical interaction data during the interaction process between the wingman and the environment updates the network parameters of D3QN, and trains to form a D3QN model. -Leader aircraft joint state), according to the acquired state information (environmental state) of the wingman himself and the adjacent wingman, conduct situation assessment to assess the risk of col...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a fixed-wing unmanned aerial vehicle cluster control collision avoidance method and device based on deep reinforcement learning. The method comprises the steps of S1, establishing an unmanned aerial vehicle kinematic model and D3QN, updating network parameters by using historical interaction data in the interaction process of each wing plane and the environment; training toform a D3QN model, constructing a joint state between the wing planes and a lead plane according to the environment state in the interaction process, performing situation assessment construction to obtain a local map, and inputting the local map into the D3QN model to obtain control instruction output of each wing plane; S2, obtaining state information in real time by each wing plane to form a joint state between the current wing plane and the lead plane, carrying out situation assessment in real time to construct a local map, and inputting the joint state constructed in real time and the local map into the D3QN network model to obtain a control instruction of each wing plane. The method has the advantages that the implementation method is simple, the expandability is good, cluster control over the fixed-wing unmanned aerial vehicles can be achieved, and meanwhile, collision is avoided.

Description

technical field [0001] The invention relates to the technical field of fixed-wing unmanned aerial vehicle cluster control, in particular to a fixed-wing unmanned aerial vehicle cluster control collision avoidance method and device based on deep reinforcement learning. Background technique [0002] With the continuous development of UAV system technology, UAVs have been widely used in various military operations and civilian tasks such as disaster search and rescue, geographic surveying and mapping, and military reconnaissance. In recent years, the application style of drones has gradually changed from single platform to multi-platform, and is developing in the direction of clustering. Although UAVs have made great progress in terms of operational autonomy in recent years, it is still a big challenge to efficiently and conveniently control UAV swarms in dynamic environments. [0003] At present, UAV swarm control methods can be divided into two categories: rule-based methods...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G05D1/10
CPCG05D1/104Y02T10/40
Inventor 闫超相晓嘉王菖吴立珍黄依新刘兴宇兰珍
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products