- pycfr 21 A python implementation of Counterfactual Regret Minimization for poker
- strips 16 A python implementation of the STRIPS planning algorithm
- diffbot 13 A .NET library for the Diffbot Frontpage and Article APIs
- rl-tictactoe 10 A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.
- tstd0 8 An experiment with Thompson sampling and TD(0) on a grid world variant
Contributions in the last year 2 total Feb 10, 2015 – Feb 10, 2016
Longest streak 1 day May 13 – May 13
Current streak 0 days Last contributed