v1.1.1
Changes
- increased readability of Q-learning agent's training loop
- added new hook for updating at the end of each training loop iteration
- removed bug in Q-learning agent when no TD error is recorded for failed transitions; now a NaN is recorded instead
- improvements to tests and docstrings