Playground for prototyping and quick or simple experiment sketches
hebbian_conditioning contains two associative learning examples for modifying robot behaviour.
reinforcement_learning contains a TD0 agent that can learn V and Q functions with standard TD0, Q-Learning and SARSA update functions using either tables or MLPs for representing the functions.
infth_feature_relevance contains some attempts at measuring feature relevance with information theoretic methods or a linear regression probe