We develop a new framework for the game of Go to target a high score, and thus a perfect play. We integrate this framework into the Monte Carlo tree search - policy iteration learning pipeline introduced by Google DeepMind with AlphaGo. Training on 9×9 Go produces a superhuman Go player, thus proving that this framework is stable and robust. We show that this player can be used to effectively play with both positional and score handicap. We develop a family of agents that can target high scores against any opponent, recover from very severe disadvantage against weak opponents, and avoid suboptimal moves.

SAI: a sensible artificial intelligence that plays with handicap and targets high scores in 9x9 Go

Metta C.
;
2020

Abstract

We develop a new framework for the game of Go to target a high score, and thus a perfect play. We integrate this framework into the Monte Carlo tree search - policy iteration learning pipeline introduced by Google DeepMind with AlphaGo. Training on 9×9 Go produces a superhuman Go player, thus proving that this framework is stable and robust. We show that this player can be used to effectively play with both positional and score handicap. We develop a family of agents that can target high scores against any opponent, recover from very severe disadvantage against weak opponents, and avoid suboptimal moves.
2020
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-1-64368-101-6
Machine Learning, Reinforcement Learning, Game Theory
File in questo prodotto:
File Dimensione Formato  
22_SAI a Sensible Artificial Intelligence that plays with handicaps and targets high score in 9x9 Go.pdf

accesso aperto

Descrizione: SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9×9 Go
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 789.93 kB
Formato Adobe PDF
789.93 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/556463
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 7
social impact