alphaGO nature 论文.pdf


立即下载 甲基蓝
2025-02-01
game games perfect optimal function search tree moves position length
2.6 MB

4 8 4 | N A T U R E | V O L 5 2 9 | 2 8 J A N U A R Y 2 0 1 6
ARTICLE
doi:10.1038/nature16961
Mastering the game of Go with deep
neural networks and tree search
David Silver1*, Aja Huang1*, Chris J. Maddison1, Arthur Guez1, Laurent Sifre1, George van den Driessche1,
Julian Schrittwieser1, Ioannis Antonoglou1, Veda Panneershelvam1, Marc Lanctot1, Sander Dieleman1, Dominik Grewe1,
John Nham2, Nal Kalchbrenner1, Ilya Sutskever2, Timothy Lillicrap1, Madeleine Leach1, Koray Kavukcuoglu1,
Thore Graepel1 & Demis Hassabis1
All games of perfect information have an optimal value function, v*(s),
which determines the outcome of the game, from every board position
or state s, under perfect play by all players. These games may be solved
by recursively computing the optimal value function in a search tree
containing approximately bd possible sequences of moves, where b is
the game’s breadth (number of legal moves per position) and d is its
depth (game length). In


game/games/perfect/optimal/function/search/tree/moves/position/length/ game/games/perfect/optimal/function/search/tree/moves/position/length/
-1 条回复
登录 后才能参与评论
-->