Recommendation method, device, electronic device and storage medium based on action pruning
A recommendation method and action technology, applied in the computer field, can solve problems such as slow convergence speed of reinforcement learning, achieve the effects of accelerating convergence speed, improving learning efficiency, and improving user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] In order to make the purpose, technical solutions and advantages of the present invention clearer, the technical solutions in the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the present invention. Obviously, the described embodiments are part of the embodiments of the present invention , but not all examples. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.
[0039] Traditional reinforcement learning algorithms converge slowly in practical applications and can only be applied to learning tasks dealing with small-scale action spaces. In this regard, the embodiment of the present invention provides a brand-new technology in the field of reinforcement learning, that is, the action pruning technology. Before each decision of the agent, the candidate a...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com