GitHub - cachett/ReinforcementLearningHalite3: First try of reinforcement learning with Dueling Deep Q-Network for the AI competition Halite 3 launched by Two Sigma. My code have reach rank 400/6000 in the competition without any rule based decision, only reinforcement learning.

To see the reinforcement learning bot competiting go to: https://halite.io/user/?user_id=1141

Dueling Deep Q-Network with prioritized experience replay playing halite 3 which is an AI competition launched by Two Sigma. The DDQN algorithms has reached rank 400/6000 with no hard coded decision and only reinforcement learning. Train a model take about 24 hours. I am generating experiences with 6 bots playing in parallel and then I update the DDQN with the PER sampled from the experiences.

To start training:

python trainer-Para.py

To visualize progress wrt game score:

python display_progress.py

To making Bot1 playing against Bot2:

halite "python Bot1.py" "python Bot2.py"

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
docs		docs
hlt		hlt
DQN-PER-Para0.py		DQN-PER-Para0.py
DQN-PER-Para1.py		DQN-PER-Para1.py
DQN-PER-Para2.py		DQN-PER-Para2.py
DQN-PER-Para3.py		DQN-PER-Para3.py
DQN-PER-Para4.py		DQN-PER-Para4.py
DQN-PER-Para5.py		DQN-PER-Para5.py
First-Para-GoodMove.png		First-Para-GoodMove.png
MyBot.py		MyBot.py
PrioritizedExperienceReplayBuffer.py		PrioritizedExperienceReplayBuffer.py
README.md		README.md
SumTree.py		SumTree.py
display_progress.py		display_progress.py
halite.PNG		halite.PNG
halite.exe		halite.exe
my_model.h5		my_model.h5
my_model1800.h5		my_model1800.h5
my_model1900.h5		my_model1900.h5
my_model2000.h5		my_model2000.h5
my_model2100.h5		my_model2100.h5
my_model2200.h5		my_model2200.h5
progress.txt		progress.txt
test.py		test.py
trainer-Para.py		trainer-Para.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

cachett/ReinforcementLearningHalite3

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages