self-play

Star

Here are 40 public repositories matching this topic...

uclaml / SPPO

Star

The official implementation of Self-Play Preference Optimization (SPPO)

deep-learning fine-tuning self-play large-language-models rlhf

Updated Jul 6, 2024
Python

opendilab / DI-engine

Star

OpenDILab Decision AI Engine

python reinforcement-learning impala reinforcement-learning-algorithms minigrid atari imitation-learning distributed-system drl inverse-reinforcement-learning r2d2 smac mujoco multiagent-reinforcement-learning pytorch-rl self-play model-based-reinforcement-learning exploration-exploitation distributed-reinforcement-learning offline-rl

Updated Jul 6, 2024
Python

opendilab / LightZero

Star

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Updated Jul 8, 2024
Python

cestpasphoto / alpha-zero-general

Star

A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available

python reinforcement-learning pytorch numba alphago the-little-prince splendor alphago-zero alphazero machikoro self-play minivilles santorini santorini-game

Updated Jun 14, 2024
Python

dellalibera / gym-backgammon

Star

Backgammon OpenAI Gym

game reinforcement-learning openai-gym artificial-intelligence gym backgammon td-learning td-gammon temporal-differencing-learning openai-gym-environment self-play gym-env backgammon-game gym-backgammon

Updated May 16, 2024
Python

uclaml / SPIN

Star

The official implementation of Self-Play Fine-Tuning (SPIN)

deep-learning fine-tuning self-play large-language-models

Updated May 8, 2024
Python

opendilab / DI-star

Star

An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

deep-learning deep-reinforcement-learning league artificial-intelligence starcraft2 self-play reinforcment-learning

Updated May 6, 2024
Python

tobiasemrich / SchafkopfRL

Star

AI agents for the bavarian card game Schafkopf trained with reinforcement learning

reinforcement-learning pytorch card-game schafkopf ppo self-play imperfect-information-game

Updated Apr 23, 2024
Python

e-dong / space-war-rl

Star

Recreating Bill Seiler's 1985 version of Space War and training RL agents with Self-Play

machine-learning reinforcement-learning ai pygame self-play pygame-wasm

Updated Apr 5, 2024
Python

rlsn / AlphaYun

Star

Play Bor-Bor Zan strategically!

reinforcement-learning nash-equilibrium self-play best-strategy

Updated Apr 3, 2024
Python

cmubig / sorts

Star

Code base for Social Robot Tree Search (SoRTS).

mcts intent-prediction self-play social-navigation

Updated Mar 19, 2024
Python

jianzhnie / RLZero

Star

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

reinforcement-learning multi-agent mcts alpha-zero self-play muzero

Updated Mar 11, 2024
Python

AutumnCrocus / shadow_sim

Star

Emulator and AI of Shadowverse

emulator machine-learning cardgame simulator ai deep-learning deep dcg imitation-learning shadowverse self-play

Updated Oct 3, 2023
Python

dellalibera / td-gammon

Star

TD-Gammon implementation

game reinforcement-learning neural-network pytorch artificial-intelligence convolutional-neural-networks backgammon value-function temporal-differencing-learning self-play

Updated Sep 25, 2023
Python

riturajkaushik / self-learning-tic-tac-toe

Star

Donald Michie's MENACE approach to an unbeatable self-learning Tic-Tac-Toe AI game

game python machine-learning reinforcement-learning gameplay tic-tac-toe-game self-play

Updated Aug 28, 2023
Python

Jackory / RPBT

Star

Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)

competition ppo population-based-training self-play multi-agent-reinforcement-learning risk-sensitive-preferences reinforcment-learning

Updated May 22, 2023
Python

TARTRL / TARTRL

Star

基于PyTorch的分布式强化学习框架

reinforcement-learning robotics pytorch game-ai distributed-training ppo self-play multi-agent-reinforcement-learning

Updated Apr 5, 2023
Python

egrund / Self-Play_DQN_Project

Star

Using an DQN agent trained on Tic-Tac-Toe and Connect Four as a base for an dynamically balanced opponent. (Student Project)

deep-learning neural-network tensorflow deep-reinforcement-learning dqn deep-q-network self-play

Updated Apr 3, 2023
Python

inspirai / TimeChamber

Star

A Massively Parallel Large Scale Self-Play Framework

reinforcement-learning deep-reinforcement-learning multi-agent self-play isaac-gym

Updated Jan 9, 2023
Python

ankursharma-iitd / AlphaZero-for-Go

Star

Implementation of Alpha Go Zero - Reinforcement Learning Project, COL870 @iit-delhi

reinforcement-learning deep-learning deep-reinforcement-learning artificial-intelligence mcts policy-gradient monte-carlo-tree-search game-playing-agent alphago-zero self-play

Updated Nov 21, 2022
Python

Improve this page

Add a description, image, and links to the self-play topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the self-play topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

self-play

Here are 40 public repositories matching this topic...

uclaml / SPPO

opendilab / DI-engine

opendilab / LightZero

cestpasphoto / alpha-zero-general

dellalibera / gym-backgammon

uclaml / SPIN

opendilab / DI-star

tobiasemrich / SchafkopfRL

e-dong / space-war-rl

rlsn / AlphaYun

cmubig / sorts

jianzhnie / RLZero

AutumnCrocus / shadow_sim

dellalibera / td-gammon

riturajkaushik / self-learning-tic-tac-toe

Jackory / RPBT

TARTRL / TARTRL

egrund / Self-Play_DQN_Project

inspirai / TimeChamber

ankursharma-iitd / AlphaZero-for-Go

Improve this page

Add this topic to your repo