GitHub - BHUVAN-RJ/Alpha-zero: Implemented an AlphaZero-style reinforcement learning agent using neural networks and Monte Carlo Tree Search, trained via self-play to play Tic-Tac-Toe and Connect Four from scratch.

AlphaZero Implementation with Monte Carlo Tree Search for Tic-Tac-Toe and Connect Four Developed a simplified AlphaZero-style reinforcement learning framework integrating neural networks with Monte Carlo Tree Search (MCTS). Implemented self-play training to learn optimal strategies from scratch and applied the system to Tic-Tac-Toe and Connect Four, demonstrating policy and value network training, game state evaluation, and search-based decision making.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
MCTS.py		MCTS.py
README.md		README.md
alpha_zero.py		alpha_zero.py
alpha_zero_parallel.py		alpha_zero_parallel.py
connectfour.py		connectfour.py
export_gif.py		export_gif.py
game_visualization.py		game_visualization.py
mcts_parallel.py		mcts_parallel.py
model_6_ConnectFour.pt		model_6_ConnectFour.pt
optimizer_6_ConnectFour.pt		optimizer_6_ConnectFour.pt
res_blocks.py		res_blocks.py
training.py		training.py
valuation.py		valuation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages