A DQN playing Pikomino

Requires dependencies:
keras, numpy

To play against the trained model:

$ ./play.py best_strategy.h5

You play first.

Example of turn:

state: ([23, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35],[22],[25, 21, 24],[3, 0, 0, 0, 2, 0],[0, 0, 0, 1, 2, 0]) / total: 23
choose action [3, 9]:

Content of state:

the 1st array is the available tiles (the stash),
the 2nd array [22] is the tiles taken by the opponent (in that order),
the 3rd array [25, 21, 24] is the tiles you already won (in that order),
the 4th array [3, 0, 0, 0, 2, 0] is the dices you have chosen: here you have 3 worms and 2 4's
the 5th array [0, 0, 0, 1, 2, 0] is the dices that have just been rolled: here 1 3 and 2 4's.
the total 23 is the number of points in the dices that have been chosen (3 * 5 + 2 * 4)

Actions:

actions below 6 mean choosing a dice value and re-rolling: 0 to keep the worms, ..., 5 to keep the 5's, then roll again
actions >= 6 mean choosing a dice value and keeping (or stealing) the corresponding tiles. This ends the turn.
if no actions are available, the turn (and a tile) is automatically lost

To train a new model:

$ ./train.py  -e 5000 -s 500 -l4

Trains for 5000 episodes (5000 games of 2 players, the model plays both players).
Every 500 episodes, the model is evaluated and saved.
The model will have 4 hidden layers of 237 cells. 237 is the width of the input layer (which represents the encoded state, and the default size of hidden layers.
The output layer always has 12 cells (which represent the q-values for each 12 actions).

Name	Name	Last commit message	Last commit date
Latest commit francoijs optimization: restore caching of results from network_q.get_all (time… Aug 7, 2019 8da15ff · Aug 7, 2019 History 94 Commits
.gitignore	.gitignore	game: add support for q-table models	Aug 7, 2019
2nd_best_strategy.h5	2nd_best_strategy.h5	game.py: play matches between any number of models	May 5, 2019
PERF	PERF	add smallest tile the roll state	Sep 9, 2017
README.md	README.md	fix formatting	Apr 23, 2019
TRAINING	TRAINING	update training traces	Jul 23, 2018
algo.py	algo.py	optimization: restore caching of results from network_q.get_all (time…	Aug 7, 2019
best_strategy.h5	best_strategy.h5	add a model that plays rather well	Apr 22, 2019
db.py	db.py	multiple fixes	Nov 4, 2017
episode.py	episode.py	optimization: restore caching of results from network_q.get_all (time…	Aug 7, 2019
game.py	game.py	game: add support for q-table models	Aug 7, 2019
picomino_play	picomino_play	game: add option --jy to play against external model trained in 'pico…	May 26, 2019
play.py	play.py	game: add support for q-table models	Aug 7, 2019
player.py	player.py	game.py: allow playing models between one another and show stats of v…	Jun 10, 2018
policy.py	policy.py	optimization: restore caching of results from network_q.get_all (time…	Aug 7, 2019
policy_jy.py	policy_jy.py	game: add option --jy to play against external model trained in 'pico…	May 26, 2019
q_hash.py	q_hash.py	game: add support for q-table models	Aug 7, 2019
q_network.py	q_network.py	optimization: restore caching of results from network_q.get_all (time…	Aug 7, 2019
roll.py	roll.py	add parameter '--validation' to run validation episodes at each step	Aug 3, 2018
samples.py	samples.py	add prog that computes number of possible states	Aug 19, 2017
state.py	state.py	fix script 'train' to use refactored class	Aug 6, 2019
state_piko.py	state_piko.py	* add generic class State with support for different games	Jul 21, 2019
state_test.py	state_test.py	refactor script 'play' to use refactored classes	Aug 6, 2019
state_ttt.py	state_ttt.py	fix script 'train' to use refactored class	Aug 6, 2019
test_algo_qlearning.py	test_algo_qlearning.py	fix unit-tests	Aug 6, 2019
test_episode.py	test_episode.py	refactor script 'play' to use refactored classes	Aug 6, 2019
test_state_piko.py	test_state_piko.py	refactor classes Episode, AlgoQLearning, Policy* with unittests	Aug 6, 2019
test_state_ttt.py	test_state_ttt.py	fix unit-tests	Aug 6, 2019
train.py	train.py	game: add support for q-table models	Aug 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A DQN playing Pikomino

About

Releases

Packages

Languages

francoijs/pikomino

Folders and files

Latest commit

History

Repository files navigation

A DQN playing Pikomino

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages