AlphaZero

About

(The project and its developers are not affiliated with Google)

This project is a replication of AlphaGo Zero described in Mastering the game of Go without human knowledge. To match the function of Alpha Zero described in Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm, the algorithms are implemented in a general framework which supports multiple board games (currently Go, mnk game, reversi).

Training

The game to train is specified in AlphaZero/config/game.yaml. The training can be started with

python -m AlphaZero.train.parallel.reinforcement

You may want to set the parameters of the system. You can do so by modifying the configuration files in AlphaZero/config.

<type_of_game>.yaml: options of game environment and the modules to be imported.
reinforce.yaml: learning parameters for reinforcement learning.
supervised.yaml: learning parameters for supervised learning.
rl_sys_config.yaml: the system settings of the trainer. The detailed explanation of each item is in Github Wiki.

Standalone Self Play Module

You can run the self play module on multiple computers.

python -m AlphaZero.train.parallel.selfplay <IP of master session>

You also need to set the system parameters including the port for connection in AlplaZero/config/rl_sys_config.yaml.

Go GUI Playing Interface

We use GoGUI for graphic UI. Although this project is not actively maintained now, our program theoretically supports all the Go visualization softwares using GTP.

Go to Program -> New Program to connect our program. Put python -m AlphaZero.gtp for command and the root directory of this project for working directory.

You can set the parameters of the player in AlphaZero/config/gtp.yaml. Only the first 4 items are important. You can also use command line arguments to override the settings in this file, which is useful when you want two players with different configuration.

You can hold matches between different programs.

gogui-twogtp -black "<command of black>" -white "<command of white>" -games <num of games> -size 19 -alternate -sgffile <dir for result> -auto

You can visualize the match.

gogui -program "gogui-twogtp -black \"<command of black>\" -white \"<command of white>\"" -computer-both

Remember to add backslash if necessary.

Command Line Playing Interface

The game to play is specified in AlphaZero/config/play_cmd.yaml. The game can be started with

python -m AlphaZero.play_cmd

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
AlphaZero		AlphaZero
docs		docs
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE-ROC		LICENSE-ROC
README.md		README.md
readthedocs.yml		readthedocs.yml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

AlphaZero

About

Training

Standalone Self Play Module

Go GUI Playing Interface

Command Line Playing Interface

About

Licenses found

Releases

Packages

Contributors 2

Languages

License

Licenses found

water-vapor/AlphaZero

Folders and files

Latest commit

History

Repository files navigation

AlphaZero

About

Training

Standalone Self Play Module

Go GUI Playing Interface

Command Line Playing Interface

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages