Playground for prototyping and quick experiment sketches
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
adversarial
data
graphs
.gitignore
.gitmodules
LICENSE
README.md
age-dist-german-parliament-population.py
common.py
dimstack.py
hebbian_conditioning.py
homeostasis.py
infth_feature_relevance.py
music_beats.py
music_features.py
music_features_print_list.py
pendulum.py
random_survival_prob.py
reinforcement_learning.py

README.md

playground

Playground for prototyping and quick or simple experiment sketches

hebbian_conditioning contains two associative learning examples for modifying robot behaviour.

reinforcement_learning contains a TD0 agent that can learn V and Q functions with standard TD0, Q-Learning and SARSA update functions using either tables or MLPs for representing the functions.

infth_feature_relevance contains some attempts at measuring feature relevance with information theoretic methods or a linear regression probe