Nonlinear multi-agent consistency method based on state observation and experience pool

A state observation and multi-agent technology, applied in the computer field, can solve problems such as difficult to satisfy, achieve the effect of simplifying the problem scene and reducing the complexity of the problem

Pending Publication Date: 2022-08-02
CHONGQING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the iterative process of neural network training, the stability and convergence of the training process usually requires the continuous excitation condition to be satisfied during the training process, but this requirement is difficult to meet in practice.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Nonlinear multi-agent consistency method based on state observation and experience pool
  • Nonlinear multi-agent consistency method based on state observation and experience pool
  • Nonlinear multi-agent consistency method based on state observation and experience pool

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The embodiments of the present invention are described below through specific specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the contents disclosed in this specification. The present invention can also be implemented or applied through other different specific embodiments, and various details in this specification can also be modified or changed based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the drawings provided in the following embodiments are only used to illustrate the basic idea of ​​the present invention in a schematic manner, and the following embodiments and features in the embodiments can be combined with each other without conflict.

[0031] Among them, the accompanying drawings are only used for exemplary description, and represent only schematic diagrams, not physical drawings, and should not be ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a nonlinear multi-agent consistency method based on state observation and an experience pool, and belongs to the field of computers. Firstly, under the scene that a follower cannot acquire the state of a leader, a backstepping method and a dynamic surface control method are used for designing a full-dimensional observer for each agent to observe the state of the leader. Afterwards, the consistency problem is converted into the optimal tracking problem of each follower to the respective leader state observer. In the next step, a non-quadratic cost function is defined for processing aiming at asymmetric saturation constraint on control input, then an optimal control problem under a new augmented system is defined, a reinforcement learning strategy algorithm is used for iterative solution, and the stability and optimality of a solution under strategy iteration are analyzed. For the difficulty that the optimal control HJB equation is difficult to directly solve, the method utilizes the good approximation property of the neural network, and an act-critic framework is used for solving.

Description

technical field [0001] The invention belongs to the field of computers, and relates to a non-linear multi-agent consistency method based on state observation and experience pool. Background technique [0002] At present, there are more and more application scenarios of multi-agent systems, and they play an increasingly important role, such as multi-robot formation, traffic control, smart grid, and multi-robot SLAM. The primary goal of achieving multi-agent coordinated control is the multi-agent consistency problem. In addition, it is usually accompanied by reaching certain conditions, such as minimizing the energy used in the process of reaching the consensus, and the time of the consensus process being as short as possible. For single-agent systems, many optimal control methods have been proposed. In the optimal cooperative control problem in the multi-agent scenario, the cooperative optimal control is generally realized by solving the CHJB equation. However, due to the n...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F30/27
CPCG06F30/27Y02D10/00
Inventor 陈刚赖鑫黄毅卿胡彬蒲嫦莉颜小力曾元
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products