Neural network model training method and device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of neural network model and training method, applied in the direction of biological neural network model, neural learning method, neural architecture, etc., can solve the problems of lack of team awareness, value deviation of intelligent agents, low simulation level of intelligent agents, etc., and achieve joint action The effect of excellent and high agent simulation level

Pending Publication Date: 2022-04-05

TENCENT TECH (SHENZHEN) CO LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Therefore, the model optimization method will cause the values of the agent to deviate, and lack of team awareness, resulting in a low simulation level of the agent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0125] The embodiment of the present application provides a neural network model training method, device, device, and storage medium, which trains the network model used in reinforcement learning in an environment where multiple agents collaborate, and encourages agents in the same camp to perform The action that maximizes its own value and the value of the team makes the joint action of the team better and achieves a higher level of agent simulation.

[0126] The terms "first", "second", "third", "fourth", etc. (if any) in the specification and claims of the present application and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the application described herein, for example, can be practiced in sequences other than those illustrated or described herein. Furthermore, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a neural network model training method realized based on an artificial intelligence technology. The method comprises the following steps: acquiring first global information and first unit information at a first moment; on the basis of the first global information and the first unit information, action distribution of each agent at the first moment is obtained through a behavior prediction network; acquiring second global information and second unit information according to the first action distribution; determining reward information according to the first global information, the first unit information, the second global information and the second unit information; based on the global information and the unit information, obtaining a target own value and a target team value through a value network; and training the network according to the action distribution, the reward information, the target own value and the target team value. The invention further provides a device. The method encourages the agents under the same camp to execute the action capable of maximizing the own value and the team value, so that the combined action of the team is better, and a higher agent simulation level is achieved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a training method, device, equipment and storage medium of a neural network model. Background technique [0002] With the development of Internet technology, there are more and more types of online games, and Multiplayer Online Battle Arena (MOBA) is one of them. In this type of game, players are usually divided into two or more opposing camps and compete with each other on a scattered game map, and each player controls a selected game character through the interface to fight against the opponent. [0003] The game character can not only be controlled by the player, but also can be controlled by an artificial intelligence (AI) model for battle. At present, the reinforcement learning algorithm is mainly used in the existing technology, so that the agent can learn strategies to maximize rewards or achieve specific goals in the process of interacting w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): A63F13/67A63F13/58G06N3/04G06N3/08

Inventor 王伟轩邱福浩练振杰王亮韩国安

Owner TENCENT TECH (SHENZHEN) CO LTD

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Neural network model training method and device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology