Strategy selection method based on Actor-Critic framework in deep reinforcement learning
A technology of reinforcement learning and program selection, applied in the field of reinforcement learning, can solve problems such as increasing the complexity of the training process, and achieve the effect of increasing the search ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] In order to describe the present invention more specifically, the technical solutions of the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0027] In this embodiment, the specific plan for the LunarLander-v2 mission is described. The goal of LunarLander-v2 is to simulate and control a lunar lander to complete the landing mission, so that it can land in the specified location area at a speed close to 0, and the input state s is 8 A continuous variable represents the position, velocity, angle, angular velocity and ground contact state of the lander. The output action a is the value range of the set A={1,2,3,4}, and the numbers in A represent four kinds of action behaviors : 1 → do nothing, 2 → ignite the left steering engine, 3 → ignite the main engine, 4 → ignite the right steering engine. Such as figure 1 As shown, the overall technical solution can be realized through the following specific...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com