AlphaZero versus Stockfish

Joined
2/7/08
Messages
3,261
Points
123
In this paper, we apply a similar but fully generic algorithm, which we call AlphaZero, to the games of chess and shogi as well as Go, without any additional domain knowledge except the rules of the game, demonstrating that a general-purpose reinforcement learning algorithm can achieve, tabula rasa, superhuman performance across many challenging domains.

https://arxiv.org/pdf/1712.01815.pdf
 
Back
Top Bottom