GitHub - gaetanserre/ray: A fork of the Ray framework that aims to implements two-players AlphaZero algorithm.

A fork of the Ray framework that aims to implements two-players AlphaZero algorithm

In ray, by default, the alpha-zero algorithm is for one-player game. Now, you can specify if your game is made for one or two players. You just have to set the mcts parameter is_two_players to True

Example:

    # === MCTS ===
    "mcts_config": {
        "puct_coefficient": 1.0,
        "num_simulations": 30,
        "temperature": 1.5,
        "dirichlet_epsilon": 0.25,
        "dirichlet_noise": 0.03,
        "argmax_tree_policy": False,
        "add_dirichlet_noise": True,
        "is_two_players": True,
    }

Example games

Some games along with their trained agent are implemented in the examples directory.

Connect2 (To win, connect two token of your color on a board mode of 1 row and 4 columns)
Tic Tac Toe
Connect4

For Connect2 and TicTacToe, their trained agent plays perfectly.

Ray provides a simple, universal API for building distributed applications.

Ray is packaged with the following libraries for accelerating machine learning workloads:

Tune: Scalable Hyperparameter Tuning
RLlib: Scalable Reinforcement Learning
Train: Distributed Deep Learning (beta)
Datasets: Distributed Data Loading and Compute

As well as libraries for taking ML and distributed apps to production:

Serve: Scalable and Programmable Serving
Workflows: Fast, Durable Application Flows (alpha)

There are also many community integrations with Ray, including Dask, MARS, Modin, Horovod, Hugging Face, Scikit-learn, and others. Check out the full list of Ray distributed libraries here.

Name		Name	Last commit message	Last commit date
Latest commit History 12,427 Commits
.buildkite		.buildkite
.github		.github
.gitpod		.gitpod
bazel		bazel
binder		binder
ci		ci
cpp		cpp
dashboard		dashboard
deploy		deploy
doc		doc
docker		docker
examples		examples
java		java
python		python
release		release
rllib		rllib
scripts		scripts
src		src
thirdparty		thirdparty
.bazelrc		.bazelrc
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.editorconfig		.editorconfig
.flake8		.flake8
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
BUILD.bazel		BUILD.bazel
CONTRIBUTING.rst		CONTRIBUTING.rst
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
WORKSPACE		WORKSPACE
build-docker.sh		build-docker.sh
build.sh		build.sh
pylintrc		pylintrc
setup_hooks.sh		setup_hooks.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A fork of the Ray framework that aims to implements two-players AlphaZero algorithm

Example games

Ray provides a simple, universal API for building distributed applications.

About

Releases

Packages

Languages

License

gaetanserre/ray

Folders and files

Latest commit

History

Repository files navigation

A fork of the Ray framework that aims to implements two-players AlphaZero algorithm

Example games

Ray provides a simple, universal API for building distributed applications.

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages