Model training method based on reinforcement learning and related device
A reinforcement learning and model training technology, applied in the computer field, can solve the problems of high interaction times and affecting the training efficiency of reinforcement learning models.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0091] The embodiment of the present application provides a model training method based on reinforcement learning and related devices, which can be applied to a system or program that includes a model training function based on reinforcement learning in a terminal device. By obtaining a preset reinforcement learning model and multiple targets Reinforcement learning model, the preset reinforcement learning model is associated with the target reinforcement learning model; then input the target sample into the preset reinforcement learning model, and perform iterative calculation in the reinforcement learning environment to obtain a sample set; and extract N from the sample set experience samples to establish a regularized Anderson objective function combined with the target reinforcement learning model; further adjust the combined Bellman residual indicated by the Anderson objective function to obtain the Anderson coefficient vector; and then determine the loss function based on t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com