Feature filtering defense method for deep reinforcement learning model
A reinforcement learning and model technology, applied in the field of deep learning, can solve problems such as non-normalization, increased training time, and inability to converge, and achieve the effect of improving training efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, and do not limit the protection scope of the present invention.
[0039] The following embodiments take the game environment as an example, and the agent establishes a relationship with the state of the environment in the interactive environment. The object of defense is the deep reinforcement learning model, and reinforcement learning generally uses Markov Decision Process (MDP) as a formalization method. In an interactive environment, collect the environment to observe the state s and let the agent take action a, and give the reward value R in time according to the change of the environment s, and save the current state, action, rewa...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com