Allie the Chess Bot

Allie is a GPT2-like chess model that learns chess from human gameplay. It is deployed on Lichess.

Requirements

Python 3.11
8 GPUs for training (adjustable in configuration)
1 GPU for evaluation
Approximately 60GB of storage space for datasets and model weights

Installation

Clone the repository:

git clone https://github.com/y0mingzhang/allie.git
cd allie

Install the package and its dependencies:
```
pip install -e .
```

Usage

Training Allie from Scratch

Download the training and evaluation datasets from Hugging Face.
Note: Try HUB_ENABLE_HF_TRANSFER=1 huggingface-cli download --repo-type dataset --local-dir data "yimingzhang/allie-data".
Place the downloaded content in a local directory named data/.
Launch training (approximately 2 weeks on 8x A6000 GPUs):
```
python src/modeling/main.py pretrain_config/medium.yaml
```
Note: You can adjust gradient accumulation in the configuration file if training on fewer than 8 GPUs.

Evaluation

Download the Allie model weights from Hugging Face.
Move the content to the models/ directory in your local repository.
Note: Try "HUB_ENABLE_HF_TRANSFER=1 huggingface-cli download --repo-type dataset --local-dir models "yimingzhang/allie-models".

Run evaluations:

For Allie-Policy:

python src/evaluation/evaluate.py \
  --config pretrain_config/medium.yaml \
  --dataset data/lichess-2022-blitz-test/2022-test-annotated.jsonl \
  --decode policy \
  --output_file "allie-eval/allie-policy.json" \
  --quick

For Allie-Adaptive-Search, the time-adaptive MCTS variant:

python src/evaluation/evaluate.py \
  --config pretrain_config/medium.yaml \
  --dataset data/lichess-2022-blitz-test/2022-test-annotated.jsonl \
  --decode adaptive-mcts \
  --output_file "allie-eval/allie-adaptive-search.json" \
  --quick

Note: Right now I don't have kv caching implemented in MCTS, and it runs extremely slowly.

For other supported inference algorithms (e.g., standard MCTS) and models, refer to DECODE_STRATEGIES in src/evaluation/decode.py.

Model Weights

The medium folder contains a GPT2-medium-like Allie model, which we use for evaluations in the paper.
The ablations folder contains models trained for the ablation study in our paper, including:
- Double parameters (large)
- Half parameters (small)
- Half compute (short)
- Half training data (half)

License

MIT - see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
pretrain_config		pretrain_config
scripts		scripts
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Allie the Chess Bot

Table of Contents

Requirements

Installation

Usage

Training Allie from Scratch

Evaluation

Model Weights

License

About

Releases

Packages

Languages

ippolito-cmu/allie

Folders and files

Latest commit

History

Repository files navigation

Allie the Chess Bot

Table of Contents

Requirements

Installation

Usage

Training Allie from Scratch

Evaluation

Model Weights

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages