Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural network model training method and device, equipment and storage medium

A technology of neural network model and training method, applied in the direction of biological neural network model, neural learning method, neural architecture, etc., can solve the problems of lack of team awareness, value deviation of intelligent agents, low simulation level of intelligent agents, etc., and achieve joint action The effect of excellent and high agent simulation level

Pending Publication Date: 2022-04-05
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the model optimization method will cause the values ​​​​of the agent to deviate, and lack of team awareness, resulting in a low simulation level of the agent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network model training method and device, equipment and storage medium
  • Neural network model training method and device, equipment and storage medium
  • Neural network model training method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0125] The embodiment of the present application provides a neural network model training method, device, device, and storage medium, which trains the network model used in reinforcement learning in an environment where multiple agents collaborate, and encourages agents in the same camp to perform The action that maximizes its own value and the value of the team makes the joint action of the team better and achieves a higher level of agent simulation.

[0126] The terms "first", "second", "third", "fourth", etc. (if any) in the specification and claims of the present application and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the application described herein, for example, can be practiced in sequences other than those illustrated or described herein. Furthermore, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a neural network model training method realized based on an artificial intelligence technology. The method comprises the following steps: acquiring first global information and first unit information at a first moment; on the basis of the first global information and the first unit information, action distribution of each agent at the first moment is obtained through a behavior prediction network; acquiring second global information and second unit information according to the first action distribution; determining reward information according to the first global information, the first unit information, the second global information and the second unit information; based on the global information and the unit information, obtaining a target own value and a target team value through a value network; and training the network according to the action distribution, the reward information, the target own value and the target team value. The invention further provides a device. The method encourages the agents under the same camp to execute the action capable of maximizing the own value and the team value, so that the combined action of the team is better, and a higher agent simulation level is achieved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a training method, device, equipment and storage medium of a neural network model. Background technique [0002] With the development of Internet technology, there are more and more types of online games, and Multiplayer Online Battle Arena (MOBA) is one of them. In this type of game, players are usually divided into two or more opposing camps and compete with each other on a scattered game map, and each player controls a selected game character through the interface to fight against the opponent. [0003] The game character can not only be controlled by the player, but also can be controlled by an artificial intelligence (AI) model for battle. At present, the reinforcement learning algorithm is mainly used in the existing technology, so that the agent can learn strategies to maximize rewards or achieve specific goals in the process of interacting w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): A63F13/67A63F13/58G06N3/04G06N3/08
Inventor 王伟轩邱福浩练振杰王亮韩国安
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products