Multi-agent reinforcement learning method for collaborative decision-making of multiple combat units

A combat unit and multi-agent technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of low coordination of multi-combat unit decision-making and difficulty in obtaining valuable training samples, so as to increase diversity and Difficulty of battle, improvement of combat decision-making ability, and effect of synergistic effects

Pending Publication Date: 2022-04-15
CHINA ACAD OF LAUNCH VEHICLE TECH
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem of the present invention is: to overcome the deficiencies of the prior art, to propose a multi-agent enhanced learning method for multi-combat unit collaborative decision-making, and to solve the red-blue party game against multi-combat unit decision-making coordination existing in the prior art Low performance, difficult to obtain valuable training samples, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-agent reinforcement learning method for collaborative decision-making of multiple combat units
  • Multi-agent reinforcement learning method for collaborative decision-making of multiple combat units
  • Multi-agent reinforcement learning method for collaborative decision-making of multiple combat units

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The present invention proposes a multi-agent enhanced learning method for multi-combat unit collaborative decision-making, such as figure 1 As shown, the steps include:

[0047] The first step is to establish a multi-agent reinforcement learning model for the red-blue game confrontation scenario, and realize intelligent collaborative decision-making modeling for multiple combat units.

[0048] The construction process of the multi-agent reinforcement learning model is as follows:

[0049] (1.1) Build a red and blue game confrontation scene;

[0050] (1.2) Analyze the task characteristics and decision points in the red and blue game confrontation scene, and determine the state space of the task decision point;

[0051] The specific method of state space design is as follows:

[0052] Before the modeling of the decision model, the overall situation information of the game confrontation scene and the local observation information of the combat unit are used as the stat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A multi-agent reinforcement learning method for collaborative decision-making of multiple combat units comprises the following steps that a multi-agent reinforcement learning model is established for a red-blue square game confrontation scene, and intelligent collaborative decision-making modeling for the multiple combat units is achieved; the number of effective training samples is increased by adopting a post-event target conversion method, and optimization convergence of a multi-agent reinforcement learning model is achieved; constructing a reward function by taking a team global task reward as a benchmark and taking a specific action reward of each combat unit as feedback information; and generating a plurality of opponent strategies according to different combat schemes, and training the multi-agent reinforcement learning model through massive simulation game confrontation by using a reward function. The method solves the problems that in the prior art, red and blue square game confrontation multi-combat-unit decision collaboration is low, and valuable training samples are difficult to obtain.

Description

technical field [0001] The invention belongs to the field of artificial intelligence technology game confrontation, and relates to a multi-agent enhanced learning method. Background technique [0002] Multi-agent deep reinforcement learning combines the collaborative ability of multi-agents with the decision-making ability of reinforcement learning to solve the collaborative decision-making problem of clusters and multi-units. It is an emerging research hotspot and application direction in the field of machine learning, which covers many algorithms. , rules, and frameworks, and are widely used in real-world fields such as automatic driving, energy distribution, formation control, track planning, routing planning, and social problems. It has extremely high research value and significance. Relevant foreign research institutions have carried out some preliminary basic technology research on multi-agent deep reinforcement learning, and domestic research on this technology, esp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2415G06F18/214
Inventor 李博遥郑本昌路鹰黄虎惠俊鹏陈海鹏王振亚李君阎岩范佳宣李丝然何昳頔张佳任金磊吴志壕刘峰范中行张旭辉赵大海韩特肖肖
Owner CHINA ACAD OF LAUNCH VEHICLE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products