Multi-agent reinforcement learning method for collaborative decision-making of multiple combat units

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A combat unit and multi-agent technology, applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of low coordination of multi-combat unit decision-making and difficulty in obtaining valuable training samples, so as to increase diversity and Difficulty of battle, improvement of combat decision-making ability, and effect of synergistic effects

Pending Publication Date: 2022-04-15

CHINA ACAD OF LAUNCH VEHICLE TECH

View PDF0 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The technical problem of the present invention is: to overcome the deficiencies of the prior art, to propose a multi-agent enhanced learning method for multi-combat unit collaborative decision-making, and to solve the red-blue party game against multi-combat unit decision-making coordination existing in the prior art Low performance, difficult to obtain valuable training samples, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0046] The present invention proposes a multi-agent enhanced learning method for multi-combat unit collaborative decision-making, such as figure 1 As shown, the steps include:

[0047] The first step is to establish a multi-agent reinforcement learning model for the red-blue game confrontation scenario, and realize intelligent collaborative decision-making modeling for multiple combat units.

[0048] The construction process of the multi-agent reinforcement learning model is as follows:

[0049] (1.1) Build a red and blue game confrontation scene;

[0050] (1.2) Analyze the task characteristics and decision points in the red and blue game confrontation scene, and determine the state space of the task decision point;

[0051] The specific method of state space design is as follows:

[0052] Before the modeling of the decision model, the overall situation information of the game confrontation scene and the local observation information of the combat unit are used as the stat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A multi-agent reinforcement learning method for collaborative decision-making of multiple combat units comprises the following steps that a multi-agent reinforcement learning model is established for a red-blue square game confrontation scene, and intelligent collaborative decision-making modeling for the multiple combat units is achieved; the number of effective training samples is increased by adopting a post-event target conversion method, and optimization convergence of a multi-agent reinforcement learning model is achieved; constructing a reward function by taking a team global task reward as a benchmark and taking a specific action reward of each combat unit as feedback information; and generating a plurality of opponent strategies according to different combat schemes, and training the multi-agent reinforcement learning model through massive simulation game confrontation by using a reward function. The method solves the problems that in the prior art, red and blue square game confrontation multi-combat-unit decision collaboration is low, and valuable training samples are difficult to obtain.

Description

technical field [0001] The invention belongs to the field of artificial intelligence technology game confrontation, and relates to a multi-agent enhanced learning method. Background technique [0002] Multi-agent deep reinforcement learning combines the collaborative ability of multi-agents with the decision-making ability of reinforcement learning to solve the collaborative decision-making problem of clusters and multi-units. It is an emerging research hotspot and application direction in the field of machine learning, which covers many algorithms. , rules, and frameworks, and are widely used in real-world fields such as automatic driving, energy distribution, formation control, track planning, routing planning, and social problems. It has extremely high research value and significance. Relevant foreign research institutions have carried out some preliminary basic technology research on multi-agent deep reinforcement learning, and domestic research on this technology, esp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/62

CPCG06F18/2415G06F18/214

Inventor 李博遥郑本昌路鹰黄虎惠俊鹏陈海鹏王振亚李君阎岩范佳宣李丝然何昳頔张佳任金磊吴志壕刘峰范中行张旭辉赵大海韩特肖肖

Owner CHINA ACAD OF LAUNCH VEHICLE TECH

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Multi-agent reinforcement learning method for collaborative decision-making of multiple combat units

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology