Applying the deep learning techniques from Alpha Go to play tic-tac-toe
These are the code examples to with my talk, the slide for which are in AlphaToe.pdf
As well as the slides, the file script/policy_gradient.py is a good starting point for the project.