Deep reinforcement learning model robustness enhancement method based on information bottleneck
A technology of reinforcement learning and information bottleneck, applied in the field of robustness enhancement of deep reinforcement learning model based on information bottleneck, can solve problems such as failure of decision-making process and unfavorable results, and achieve the effect of resisting influence
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0048] The solutions of the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.
[0049]The robustness enhancement method of the deep reinforcement learning model based on the information bottleneck limits the state information in the deep reinforcement learning by setting the information bottleneck, and encodes the state information in the transfer tuple through an encoder. First, code the state observed in the environment, and input it into the policy network after coding, interact with the environment according to the actions of the policy network, and get the state of the next round, and then code the state to continuously interact with the environment to realize the strategy network. train. The training of the encoder is realized by adding a regular term to the loss term of the original deep reinforcement learning algorithm. Since the encoder and the policy network interact with each other, the idea of ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com