- pycfr 21 A python implementation of Counterfactual Regret Minimization for poker
- strips 15 A python implementation of the STRIPS planning algorithm
- diffbot 13 A .NET library for the Diffbot Frontpage and Article APIs
- rl-tictactoe 9 A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.
- tstd0 8 An experiment with Thompson sampling and TD(0) on a grid world variant
Contributions in the last year 3 total Dec 1, 2014 – Dec 1, 2015
Longest streak 1 day December 29 – December 29
Current streak 0 days Last contributed