Crowd evacuation simulation method and system based on deep reinforcement learning algorithm
A technology of reinforcement learning and simulation methods, applied in neural learning methods, design optimization/simulation, computing, etc., can solve the problems of high algorithm complexity, slow algorithm convergence speed, and algorithm difficulty in obtaining results, etc., to improve the convergence speed, The effect of optimizing the results and reducing the amount of calculation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] In this embodiment, a crowd evacuation simulation method based on a deep reinforcement learning algorithm is disclosed, including:
[0045] Perform initial setting of parameters in the evacuation scene simulation model according to the scene information and crowd parameter information;
[0046] Group the crowd and divide the leaders within the group;
[0047] Each leader selects the best exit as the evacuation target, and uses the improved multi-agent depth deterministic strategy gradient algorithm for global path planning to obtain the optimal evacuation path;
[0048] Ordinary pedestrians in the group follow the movement of the leader in the group.
[0049] Further, each group of leaders is regarded as an agent, and the improved multi-agent deep deterministic policy gradient algorithm is used for global path planning, including:
[0050] Set the movable direction and current position of the agent;
[0051] Set the reward and return mechanism of the Critic network, an...
Embodiment 2
[0115] In this embodiment, a crowd evacuation simulation system based on a strategy-optimized deep reinforcement learning algorithm is disclosed, including:
[0116] The initialization setting module performs initialization setting of parameters in the evacuation scene simulation model according to the scene information and crowd parameter information;
[0117] The leader selection module in the group realizes the grouping of people and selects the leader in the group;
[0118] In the evacuation simulation module, each leader selects the best exit as the evacuation target, uses the improved multi-agent depth deterministic strategy gradient algorithm to perform global path planning, and obtains the optimal evacuation path. Ordinary pedestrians in the group follow the movement of the leader in the group .
Embodiment 3
[0120] An electronic device, comprising a memory, a processor, and computer instructions stored in the memory and run on the processor, when the computer instructions are run by the processor, the crowd evacuation simulation based on the deep reinforcement learning algorithm disclosed in Embodiment 1 is completed steps described in the method.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com