Improved Policy Networks for Computer Go

Tristan Cazenave

Published: 2017, Last Modified: 30 Sept 2024ACG 2017EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Golois uses residual policy networks to play Go. Two improvements to these residual policy networks are proposed and tested. The first one is to use three output planes. The second one is to add Spatial Batch Normalization.