Tic-tac-toe environment for the ES training process #4

Zeta36 · 2017-04-22T06:25:59Z

Using as base your code, I have developed a new Tic-tac-toe environment for the ES training process. As this game can be studied in full depth by a classical min-max tree, I've used this classic AI to play against our neural network model in the "step" phase and to return so the reward.

The last result is a model (a simple "Linear" one) that thanks to the evolutionary computation can reach a zero-perfect game against the classical AI brute force strategy.

My code is here: https://github.com/Zeta36/pytorch-es-tic-tac-toe

I simplified also your code a little and I removed thing I knew I was not going to need.

Thanks for your work, friend.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tic-tac-toe environment for the ES training process #4

Tic-tac-toe environment for the ES training process #4

Zeta36 commented Apr 22, 2017

Tic-tac-toe environment for the ES training process #4

Tic-tac-toe environment for the ES training process #4

Comments

Zeta36 commented Apr 22, 2017