katac4

An AlphaZero engine for Saiblo Connect4, featuring a pure Python implementation of key KataGo techniques.

Project Structure

katac4/
├── README.md        # This file
│
├── runs/            # TensorBoard logs
├── weights/         # Model checkpoints for each training run
│
├── docs/            # Documentation files
│   └── methods.md    # Overview of improvements over original AlphaZero
│
├── saiblo/          # Saiblo Connect4 submission package
│   ├── main.py       # Entry point
│   ├── game.py       # Standalone game environment (with zobrist hash)
│   └── search.py     # Monte Carlo Graph Search, used during online play
│
├── train.py         # Main training script
├── model.py         # Neural network model definition (b3c128nbt)
├── game.py          # Game environment
├── mcts.py          # Monte Carlo Tree Search, used during training
├── elo_eval.py      # Script for evaluating model ELO ratings
├── elo.json         # ELO ratings generated by elo_eval.py
├── elo_plot.py      # Plots ELO rating progression
├── export_model.py  # Converts model to TorchScript for deployment
├── benchmark.py     # Benchmarks model performance
└── human_play.py    # Allows human interaction with the model

Quick Start

Self-Play Training

Before starting self-play training, you may want to adjust a few parameters in train.py:

epochs: Total number of training epochs
epoch_size: Number of self-play games per epoch
parallel_games: Number of games run in parallel
num_gpus: Number of GPUs to utilize

Important

To exactly reproduce the most recent training run, do not change the default parameter values in the code.

To begin training, run:

python3 train.py

Model checkpoints will be saved to the weights/ directory. To monitor progress with TensorBoard:

tensorboard --logdir runs

ELO Evaluation

To evaluate model performance using ELO, edit the ai_list and weight_format variables in elo_eval.py as needed. Then run:

python3 elo_eval.py

Models will compete in randomized pairings, with results saved to elo.json. To visualize the rating progression:

python3 elo_plot.py

Note

The ELO evaluation process is computationally intensive. For reference, the most recent evaluation was based on approximately 70,000 games and required 2.5 days to complete on four RTX 4090 GPUs.

Submitting to Saiblo

Select your best-performing model checkpoint (typically the final one or the one with highest ELO), and set its path in export_model.py. Export it to TorchScript format by running:

python3 export_model.py

This will generate model.pt in the saiblo/ directory. To create your submission, zip only the files inside the saiblo/ folder—do not include the folder itself in the archive.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

katac4

Project Structure

Quick Start

Self-Play Training

ELO Evaluation

Submitting to Saiblo

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
docs		docs
runs/b3c128nbt_2025-05-24_20-47-22		runs/b3c128nbt_2025-05-24_20-47-22
saiblo		saiblo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
elo.json		elo.json
elo_eval.py		elo_eval.py
elo_plot.py		elo_plot.py
export_model.py		export_model.py
game.py		game.py
human_play.py		human_play.py
mcts.py		mcts.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py

License

GoodCoder666/katac4

Folders and files

Latest commit

History

Repository files navigation

katac4

Project Structure

Quick Start

Self-Play Training

ELO Evaluation

Submitting to Saiblo

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages