alphaGO nature 论文.pdf

立即下载 作者: 甲基蓝
上传时间: 2025-05-11
关键词: game games perfect optimal function search tree moves position length
大小 2.6 MB
描述

   4 8 4  |  N A T U R E  |  V O L  5 2 9  |  2 8  J A N U A R Y  2 0 1 6
ARTICLE
doi:10.1038/nature16961
Mastering the game of Go with deep
neural networks and tree search
David Silver1*, Aja Huang1*, Chris J. Maddison1, Arthur Guez1, Laurent Sifre1, George van den Driessche1, 
Julian Schrittwieser1, Ioannis Antonoglou1, Veda Panneershelvam1, Marc Lanctot1, Sander Dieleman1, Dominik Grewe1,
John Nham2, Nal Kalchbrenner1, Ilya Sutskever2, Timothy Lillicrap1, Madeleine Leach1, Koray Kavukcuoglu1, 
Thore Graepel1 & Demis Hassabis1
All games of perfect information have an optimal value function, v*(s),
which determines the outcome of the game, from every board position
or state s, under perfect play by all players. These games may be solved
by recursively computing the optimal value function in a search tree
containing approximately bd possible sequences of moves, where b is
the game’s breadth (number of legal moves per position) and d is its
depth (game length). In

目录
game/games/perfect/optimal/function/search/tree/moves/position/length/ game/games/perfect/optimal/function/search/tree/moves/position/length/

-1 条回复

登录后才能参与评论