Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Strategy improvement method of search game tree on Go

A strategy, Go technology, applied in the field of strategy improvement of search game tree, can solve the problem of no reduction in search time, insufficient, etc.

Pending Publication Date: 2021-09-10
沈阳雅译网络技术有限公司
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the minimax search method and some other pruning algorithms can effectively improve the search efficiency, these methods are not enough for a deep search tree like Go, and the search time has not been reduced to a satisfactory result.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Strategy improvement method of search game tree on Go
  • Strategy improvement method of search game tree on Go
  • Strategy improvement method of search game tree on Go

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention provides a kind of strategy improvement method of the search game tree on Go, comprises the following steps:

[0030] 1) Establish a search tree with the current state as the root node, the tree is established, and all other nodes are given implicitly;

[0031] 2) Select a child node of the root node to simulate. If there is a child node that has not been simulated, then randomly select a child from the child nodes of the root node for simulation; if all child nodes have been simulated at least once, then select The highest child node of the UCB tree;

[0032] 3) Simulate from the selected child node to the end of the leaf node; the simulation strategy combines uniform sampling and minimax strategy;

[0033] 4) Backpropagate the final result of the simulation to the root node, and the Q and N values ​​of all leaves on the path are updated;

[0034] 5) Repeat steps 1)-4) multiple times, and finally select the node with the highest utilization item...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a strategy improvement method for a search game tree on Go. The method comprises the following steps: establishing a search tree by taking a current state as a root node; selecting a child node of the root node for simulation, and if a child node is not simulated, randomly selecting a child from the child node of the root node for simulation; if all the child nodes are simulated for at least one time, selecting the child node with the highest UCB sub-tree; starting to simulate from the selected child node to the leaf node; combining the simulation strategy with uniform sampling and a minimum and maximum strategy; propagating the final simulation result reversely to the root node, and updating the Q value and the N value of the action value function of all leaves on the path; and repeating the steps for many times, and finally selecting the node with the highest utilization item score in the UCB. According to the invention, the improved algorithm is applied to the search strategy of the Go, the evaluation of GNUGo and CGOS is passed, and the final experimental result shows that the algorithm can improve the accuracy of game search in the Go.

Description

technical field [0001] The invention relates to a strategy improvement method for searching a game tree on Go, in particular to a strategy improvement method for searching a game tree on Go. Background technique [0002] Monte Carlo methods have a long history in numerical algorithms and have achieved notable success in various artificial intelligence game algorithms, especially in games of incomplete information such as Scrabble and bridge. However, its real success on the computer is through the recursive application of the Monte Carlo method in the tree building process, which is the main research content of MCTS. Go is one of the few classic games in which human players are far ahead of computer players, MCTS has played a huge role in closing this gap, and it now competes with the best human players on small boards, although MCT plays on standard The 19×19 chessboard is well below their level. Go is a difficult game for computers to play: it has a high branching factor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/22G06F16/2457G06F16/248
CPCG06F16/2246G06F16/2457G06F16/248
Inventor 宁义明杨木润赵闯
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products