RL models clean up #112

djbyrne · 2020-07-12T10:00:01Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Refactors RL models to use train_batch structure as seen in #107

general clean up of docstrings and methods

PR review

Did you have fun?

👍

…lightning-bolts

…-bolts

pep8speaks · 2020-07-12T10:00:07Z

Hello @djbyrne! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-07-14 15:48:24 UTC

codecov · 2020-07-12T10:11:14Z

Codecov Report

Merging #112 into master will increase coverage by 0.00%.
The diff coverage is 86.48%.

@@           Coverage Diff           @@
##           master     #112   +/-   ##
=======================================
  Coverage   91.91%   91.92%           
=======================================
  Files          77       78    +1     
  Lines        3944     4010   +66     
=======================================
+ Hits         3625     3686   +61     
- Misses        319      324    +5

Flag	Coverage Δ
#unittests	`91.92% <86.48%> (+<0.01%)`	⬆️

Impacted Files	Coverage Δ
pl_bolts/models/rl/double_dqn_model.py	`94.44% <ø> (+20.25%)`	⬆️
pl_bolts/models/rl/n_step_dqn_model.py	`100.00% <ø> (ø)`
pl_bolts/models/rl/noisy_dqn_model.py	`95.83% <ø> (+18.05%)`	⬆️
pl_bolts/models/rl/per_dqn_model.py	`57.14% <9.09%> (-23.81%)`	⬇️
pl_bolts/models/rl/dqn_model.py	`82.88% <75.00%> (+0.35%)`	⬆️
pl_bolts/datamodules/experience_source.py	`97.72% <97.72%> (ø)`
pl_bolts/datamodules/__init__.py	`100.00% <100.00%> (ø)`
pl_bolts/models/rl/common/experience.py	`97.08% <100.00%> (+0.14%)`	⬆️
pl_bolts/models/rl/reinforce_model.py	`97.43% <100.00%> (+0.02%)`	⬆️
...l_bolts/models/rl/vanilla_policy_gradient_model.py	`96.55% <100.00%> (-1.16%)`	⬇️
... and 11 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 024b574...79e7e5c. Read the comment docs.

pl_bolts/models/rl/dqn_model.py

Borda · 2020-07-12T22:13:38Z

pl_bolts/models/rl/per_dqn_model.py

+            self.episode_reward = 0
+            self.episode_steps = 0


shall this be rater at the beginning rather than the end?

Im not sure I understand

that you reset the episode_steps and the other when it is done... so shall it be more logical to reset it before you start rather?

Ah I see. The done is a local variable that is retrieved after taking a step on line 72, so the done check must come after that, so I think it makes more sense to do it at the end

pl_bolts/models/self_supervised/amdim/networks.py

…jbyrne/pytorch-lightning-bolts into enhancement/rl_models_clean_up

mergify · 2020-07-29T23:31:38Z

This pull request is now in conflict... :(

Donal and others added 16 commits June 24, 2020 07:56

Updated RL docs with latest models

9c06583

Merge branch 'master' of https://github.com/PyTorchLightning/pytorch-…

33be076

…lightning-bolts

Merge branch 'master' of https://github.com/PyTorchLightning/pytorch-…

fdc92f9

…lightning-bolts

Merge branch 'master' of https://github.com/PyTorchLightning/pytorch-…

682bbe6

…lightning-bolts

Merge branch 'master' of https://github.com/PyTorchLightning/pytorch-…

17073bc

…lightning-bolts

Merge branch 'master' of https://github.com/PyTorchLightning/pytorch-…

d05db21

…lightning-bolts

Updated RL docs with latest models

8cde396

Merge branch 'master' of https://github.com/djbyrne/pytorch-lightning…

96aaa97

…-bolts

Cleaned up avg_reward calculation

885be16

Refactored DQN to use train_batch structure

00a8547

Merge branch 'master' into enhancement/rl_models_clean_up

0aca98d

Cleaned up VPG metrics

cfd139e

Refactore double dqn to use train_batch structure

2741c5b

Refactored noisy dqn to use train_batch structure

ad54460

Refactored per dqn to use train_batch structure

164c7b4

Updated docstrings

407ff94

mergify bot requested a review from Borda July 12, 2020 10:00

Borda changed the title ~~Enhancement/rl models clean up~~ RL models clean up Jul 12, 2020

Borda added the enhancement New feature or request label Jul 12, 2020

Borda reviewed Jul 12, 2020

View reviewed changes

pl_bolts/models/rl/dqn_model.py Outdated Show resolved Hide resolved

Borda reviewed Jul 12, 2020

View reviewed changes

Borda and others added 2 commits July 13, 2020 00:28

format

6df878f

Apply suggestions from code review

44e0006

djbyrne commented Jul 13, 2020

View reviewed changes

pl_bolts/models/self_supervised/amdim/networks.py Outdated Show resolved Hide resolved

Borda and others added 4 commits July 13, 2020 09:07

typo

4f3d164

Merge branch 'enhancement/rl_models_clean_up' of https://github.com/d…

0333ebb

…jbyrne/pytorch-lightning-bolts into enhancement/rl_models_clean_up

Fixed pep8 errors

2e18e19

Fixed flake8 errors

79e7e5c

Borda changed the base branch from master to master_RL September 8, 2020 21:22

Borda closed this Oct 7, 2020

Borda added this to Done in Reinforcement Learning Nov 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RL models clean up #112

RL models clean up #112

djbyrne commented Jul 12, 2020

pep8speaks commented Jul 12, 2020 •

edited

codecov bot commented Jul 12, 2020 •

edited

Borda Jul 12, 2020

djbyrne Jul 13, 2020

Borda Jul 13, 2020

djbyrne Jul 14, 2020

mergify bot commented Jul 29, 2020

RL models clean up #112

RL models clean up #112

Conversation

djbyrne commented Jul 12, 2020

Before submitting

What does this PR do?

PR review

Did you have fun?

pep8speaks commented Jul 12, 2020 • edited

Comment last updated at 2020-07-14 15:48:24 UTC

codecov bot commented Jul 12, 2020 • edited

Codecov Report

Borda Jul 12, 2020

Choose a reason for hiding this comment

djbyrne Jul 13, 2020

Choose a reason for hiding this comment

Borda Jul 13, 2020

Choose a reason for hiding this comment

djbyrne Jul 14, 2020

Choose a reason for hiding this comment

mergify bot commented Jul 29, 2020

pep8speaks commented Jul 12, 2020 •

edited

codecov bot commented Jul 12, 2020 •

edited