Intelligent agent control method and device based on reinforcement learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of reinforcement learning and control methods, applied in neural learning methods, machine learning, biological neural network models, etc., can solve problems such as limited algorithm effects, and achieve the effect of simplifying the strategy learning process and facilitating the expansion of quantity and types

Pending Publication Date: 2021-01-12

天津(滨海)人工智能军民融合创新中心

View PDF6 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, in a large-scale multi-agent environment, especially in an environment with many types of agents, the effect of the algorithm is still limited.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0054] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0055] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0056] In the field of multi-agent cooperative control, in order to effectively simplify the strategy learning process and reduce the complexity of the interaction between agents, the p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to an intelligent agent control method and device based on reinforcement learning. The method comprises the steps of obtaining current local observation of an intelligent agent;taking the current local observation of the intelligent agent as the input of a reinforcement learning model, and obtaining the current execution action of the intelligent agent output by the reinforcement learning model; controlling an intelligent agent to execute the current execution action of the intelligent agent; according to the technical scheme provided by the invention, the strategy learning process in a large-scale multi-intelligent system can be effectively simplified, the number and types of intelligent agents are easy to expand, and the method has potential value in large-scale real world application.

Description

technical field [0001] The invention relates to the field of multi-agent cooperative control, in particular to an agent control method and device based on reinforcement learning. Background technique [0002] In recent years, the rapid development of deep reinforcement learning has made researchers interested in multi-agent reinforcement learning, hoping that it can solve complex and large-scale problems, such as vehicle autonomous driving, resource allocation, swarm robots, human machine interaction, etc. At present, certain research results have been achieved in multi-agent reinforcement learning, such as communication and natural language processing, multi-agent games, traffic control, social dilemmas, etc. At the same time, more and more researchers began to pay attention to the research of large-scale multi-agent reinforcement learning. In large-scale multi-agent systems, a large number of agents with different goals and complex interactions between agents pose great ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/08G06N3/04G06N20/00

CPCG06N3/084G06N3/049G06N20/00G06N3/045Y02T10/40

Inventor 史殿习姜浩薛超康颖金松昌郝锋秦伟

Owner 天津(滨海)人工智能军民融合创新中心

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Intelligent agent control method and device based on reinforcement learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology