Go Attack

This repository contains code for studying the adversarial robustness of KataGo.

Read about our research here: https://arxiv.org/abs/2211.00241.

View our website here: https://goattack.far.ai/.

To run our adversary with Sabaki, see this guide.

Development / testing information

To clone this repository, run one of the following commands

# Via HTTPS
git clone --recurse-submodules https://github.com/AlignmentResearch/go_attack.git

# Via SSH
git clone --recurse-submodules git@github.com:AlignmentResearch/go_attack.git

You can run pip install -e .[dev] inside the project root directory to install all necessary dependencies.

To run a pre-commit script before each commit, run pre-commit install (pre-commit should already have been installed in the previous step). You may also want to run pre-commit install from engines/KataGo-custom to install that repository's respective commit hook.

Git submodules

Modifications to KataGo are not tracked in this repository and should instead be made to the AlignmentResearch/KataGo-custom repository. We use code from KataGo-custom in this repository via a Git submodule.

engines/KataGo-custom tracks the stable branch of the KataGo-custom repository.
engines/KataGo-raw tracks the master branch of https://github.com/lightvector/KataGo.

Individual containers

We run KataGo within Docker containers. More specifically:

The C++ portion of KataGo runs in the container defined by compose/cpp/Dockerfile.
The Python training portion of KataGo runs in the container defined at compose/python/Dockerfile.

The Dockerfiles contain instructions for how to build them individually. This is useful if you want to test just one of the docker containers.

A KataGo executable can be found in the /engines/KataGo-custom/cpp directory inside the container. To run a docker container, you can use a command like

docker run --gpus all -v ~/go_attack:/go_attack -it humancompatibleai/goattack:cpp

Docker compose

Within the compose directory of this repo are a few docker-compose .yml files that automate the process of spinning up the various components of training.

Each .yml file also has a corresponding .env that configures more specific parameters of the run ( e.g. what directory to write to, how many threads to use, batch size, where to look for other config files ).

Website and analysis notebooks

See AlignmentResearch/KataGoVisualizer.

Baseline attacks

In addition to the learned attacks, we also implement 5 baseline, hardcoded attacks:

Edge attack, which plays random vertices in the outermost available ring of the board
Random attack, which simply plays random legal moves
Pass attack, which always passes at every turn
Spiral attack, which deterministically plays the "largest" legal move in lexicographical order in polar coordinates (going counterclockwise starting from the outermost ring)
Mirror Go, which plays the opponent's last move reflected about the y = x diagonal, or the y = -x diagonal if they play on y = x. If the mirrored vertex is taken, then the policy plays the "closest" legal vertex by L1 distance.

You can test these attacks by running baseline_attacks.py with the appropriate --strategy flag (edge, random, pass, spiral, or mirror). Run python scripts/baseline_attacks.py --help for more information about all the available flags.

Name		Name	Last commit message	Last commit date
Latest commit History 252 Commits
.circleci		.circleci
.vscode		.vscode
ci		ci
compose		compose
configs		configs
controllers		controllers
engines		engines
kubernetes		kubernetes
openings		openings
plot		plot
sabaki		sabaki
scripts		scripts
src/go_attack		src/go_attack
tests		tests
.codespell.skip		.codespell.skip
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

License

AlignmentResearch/go_attack

Folders and files

Latest commit

History

Repository files navigation

Go Attack

Development / testing information

Git submodules

Individual containers

Docker compose

Website and analysis notebooks

Baseline attacks

About

Resources

License

Stars

Watchers

Forks

Languages