Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cargo transportation system based on multi-agent reinforcement learning

A technology for reinforcement learning and cargo transportation, applied in the field of multi-agent systems, can solve problems such as inability to solve convergence problems, affect system performance and efficiency, slow down system convergence speed, etc. Effect

Active Publication Date: 2020-04-10
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF6 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But in fact, in a multi-agent environment, due to differences in distance, speed and other factors, for a single freight agent, there may be some unnecessary information or even interference information in all freight agent information, and the communication information If the amount is too large, it may slow down the convergence of the system and affect the performance and efficiency of the entire system
In addition, in the current research on multi-intelligence reinforcement learning, it cannot solve the convergence problem when the number of freight agents is large.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cargo transportation system based on multi-agent reinforcement learning
  • Cargo transportation system based on multi-agent reinforcement learning
  • Cargo transportation system based on multi-agent reinforcement learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to facilitate the understanding of the present invention, the present invention will be described more fully below with reference to the associated drawings. Preferred embodiments of the invention are shown in the accompanying drawings. However, the present invention can be embodied in many different forms and is not limited to the embodiments described herein. On the contrary, these embodiments are provided to make the understanding of the disclosure of the present invention more thorough and comprehensive.

[0042] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field of the invention. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention.

[0043] In the cargo transportation system based on multi-agent reinforcement learnin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a cargo transportation system based on multi-agent reinforcement learning. The cargo transportation system comprises cargo transportation agents, a grouping module and a modelconstruction module. The grouping module is used for acquiring the position coordinates of each freight intelligent agent and grouping all freight intelligent agents according to a dynamic grouping algorithm to obtain at least one freight intelligent agent group; the model construction module is used for carrying out weight division on the freight intelligent agents in each freight intelligent agent group through an implicit weight endowing algorithm, and carrying out implicit coordination control on a plurality of freight intelligent agents in each freight intelligent agent group; a neural network is constructed by adopting a centralized reviewer mode of a multi-agent depth deterministic strategy gradient algorithm, an optimized path of multiple freight agents is generated through a neural network, and the freight agents in the freight agent group bypass obstacles according to the optimized path and reach landmarks. The system can process a large number of freight intelligent agentswith large communication information amount, and is good in performance, high in efficiency and low in cost.

Description

technical field [0001] The invention belongs to the technical field of multi-agent systems, in particular to a cargo transportation system based on multi-agent reinforcement learning. Background technique [0002] With the development of artificial intelligence, communication and information technology, the research of multi-agent has been a research hotspot that many people pay attention to in recent years. Multi-agent systems can be widely used in public facilities detection, disaster environment investigation, military reconnaissance, storage and handling and other fields, and have been widely used in both military and civilian applications. In the process of cargo transportation, it is a very important issue to enable multiple freight agents to intelligently plan routes to reach multiple different locations to place goods, because this can speed up the efficiency of freight transportation and reduce labor costs. Now It is also increasingly becoming a research focus. Am...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06Q10/08G06N3/04G06N3/08
CPCG06Q10/083G06Q10/0833G06Q10/08355G06N3/08G06N3/045
Inventor 姜元爽宁立张涌冯圣中
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products