Tic-tac-toe Simple RL example of Tic-tac-toe. Train Training for 10000 games: lr AI win AI lose tie 0.1 7994 978 1028