Add mountain car reward heatmap and reward-vs-time plotters #177

shwang · 2020-03-19T03:33:00Z

Putting the Heatmap / reward vs time plotting code from my ipynb into a python file.

Not sure if experiments/ is the best place to put this. Maybe imitation.scripts is better since the user might want to import these plotting functions into another file.

While translating the code from one format to another, I found some bugs, related yet again to VecNormalize. In this case it seems like my reward functions might not have actually received normalized inputs.

So I'd expect heatmaps to look different upon a new run.

codecov · 2020-03-19T03:41:23Z

Codecov Report

Merging #177 into master will increase coverage by 2.25%.
The diff coverage is 98.87%.

@@            Coverage Diff             @@
##           master     #177      +/-   ##
==========================================
+ Coverage   85.63%   87.89%   +2.25%     
==========================================
  Files          64       67       +3     
  Lines        4519     4683     +164     
==========================================
+ Hits         3870     4116     +246     
+ Misses        649      567      -82

Impacted Files	Coverage Δ
src/imitation/analysis/mountain_car_plots.py	`98.07% <98.07%> (ø)`
src/imitation/rewards/common.py	`100.00% <100.00%> (ø)`
src/imitation/rewards/serialize.py	`100.00% <100.00%> (ø)`
src/imitation/util/reward_wrapper.py	`89.58% <100.00%> (-0.22%)`	⬇️
src/imitation/util/rollout.py	`95.70% <100.00%> (ø)`
tests/test_density_baselines.py	`100.00% <100.00%> (ø)`
tests/test_mountain_car_plots.py	`100.00% <100.00%> (ø)`
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 96d9f8d...19c798e. Read the comment docs.

AdamGleave

Overall good. One change I'd suggest in your style is using explicit matplotlib Figure and Axes objects, rather than depending on the global state in matplotlib. This is more flexible and less error prone. Otherwise fairly minor comments.

experiments/mountain_car_plots.py

src/imitation/util/reward_wrapper.py

shwang · 2020-03-24T06:06:12Z

Thanks for the review. I've unsuccessfully attempted before to figure out how to gracefully use the OOP interface to matplotlib, so it was good to learn a cleaner way to manage these plots.

It's a pain for me to manually check whether plots are working so I'm going to write a smoke test of some sort. Ask for another round of review after then.

Needs to be in package for importing, testing and otherwise.

shwang · 2020-03-28T23:18:36Z

I moved some reward-related helper functions and util.reward_wrapper.RewardFn into a new submodule imitation.rewards.common.

To make the functions importable and testable, I moved mountain_car_plots.py from experiments/ to a new submodule imitation.analysis.mountain_car_plots. (Also considered imitation.scripts, but this isn't really a script right now, more like a collection of plotting tools.)

This should be ready for another round of review now.

Allows implicit namespace packages (PEP 420). https://setuptools.readthedocs.io/en/latest/setuptools.html#find-namespace-packages

This reverts commit 78eabf6.

shwang · 2020-03-29T00:38:31Z

Somehow, this branch is set off the Sacred race condition that is fixed by IDSIA/sacred#473 4 out of 5 times I reran tests.

Maybe this is bad luck on my part (in one of the reruns, train_experts.sh was the breaking script, in another rerun train_experts.sh was fine but transfer_learn...sh was broken.); maybe this is because CircleCI somehow changed.

shwang · 2020-03-29T00:56:44Z

I think it would be really helpful for me if Sacred configs were picklable. This would allow us to upgrade to a version of Sacred that doesn't run into the FileStorageObserver race condition when I'm running local experiments, and let me use the same version of Sacred that is used by the Minecraft project.

Asked on IDSIA/sacred#508 if all they need is just dropping in pyrsistent.PMap.

AdamGleave

LGTM, new OO-style for matplotlib much easier to understand, and +1 for tests.

There's a few minor changes I've suggested but coding as approve since no need for me to review again.

src/imitation/analysis/mountain_car_plots.py

src/imitation/rewards/common.py

Co-Authored-By: Adam Gleave <adam@gleave.me>

shwang added 6 commits March 18, 2020 19:58

Add mountain car heatmap/reward plot generators

d648fe9

lint

338c1f9

RewardFn: Allow Optional steps

6c674ba

Add pandas and seaborn requirements

e3a6dc9

more lint

a72e2bd

more lint

348586a

shwang requested a review from AdamGleave March 19, 2020 03:33

AdamGleave requested changes Mar 19, 2020

View reviewed changes

Address comments

1ab2859

shwang added 7 commits March 27, 2020 13:42

RewardFn: Move to rewards.common

7d54551

mc_plots: Move build_norm_reward_fn to common

a5f8c87

Actually add common.py, whoops

f9b316e

Lint, docstrings, don't auto-save figs

c84b60f

Move mountain_car_plots to imitation.analysis

4464e51

Needs to be in package for importing, testing and otherwise.

mc_plots: Add tests and lint

a6e981a

tests/test_{=>mc_}plots.py

72e1731

shwang added 5 commits March 28, 2020 16:23

setup.py: Use find_namespace_packages

78eabf6

Allows implicit namespace packages (PEP 420). https://setuptools.readthedocs.io/en/latest/setuptools.html#find-namespace-packages

test_mc_plots: Fix docstring, rename file

cb9dce4

Revert "setup.py: Use find_namespace_packages"

540659b

This reverts commit 78eabf6.

Add imitation.analysis.__init__

39cd99f

test_mc_plots: Pickle and load fake rollouts

705c6bc

shwang requested a review from AdamGleave March 29, 2020 01:02

AdamGleave approved these changes Mar 31, 2020

View reviewed changes

shwang and others added 2 commits April 7, 2020 15:06

Update src/imitation/analysis/mountain_car_plots.py

8d24f99

Co-Authored-By: Adam Gleave <adam@gleave.me>

Update src/imitation/rewards/common.py

e20a56f

Co-Authored-By: Adam Gleave <adam@gleave.me>

Address review

19c798e

shwang merged commit d4c2806 into master Apr 7, 2020

shwang deleted the mc_plots branch April 7, 2020 23:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mountain car reward heatmap and reward-vs-time plotters #177

Add mountain car reward heatmap and reward-vs-time plotters #177

shwang commented Mar 19, 2020

codecov bot commented Mar 19, 2020 •

edited

Loading

AdamGleave left a comment

shwang commented Mar 24, 2020

shwang commented Mar 28, 2020

shwang commented Mar 29, 2020 •

edited

Loading

shwang commented Mar 29, 2020 •

edited

Loading

AdamGleave left a comment

Add mountain car reward heatmap and reward-vs-time plotters #177

Add mountain car reward heatmap and reward-vs-time plotters #177

Conversation

shwang commented Mar 19, 2020

codecov bot commented Mar 19, 2020 • edited Loading

Codecov Report

AdamGleave left a comment

Choose a reason for hiding this comment

shwang commented Mar 24, 2020

shwang commented Mar 28, 2020

shwang commented Mar 29, 2020 • edited Loading

shwang commented Mar 29, 2020 • edited Loading

AdamGleave left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 19, 2020 •

edited

Loading

shwang commented Mar 29, 2020 •

edited

Loading

shwang commented Mar 29, 2020 •

edited

Loading