Rename self.evaluate for off-policy algos #1139

zequnyu · 2020-01-19T20:00:19Z

also removes duplicate performance recording after evaluate_performance() being introduced.

ryanjulian

See my comment about deleting this bool altogether. It would make these classes less likely to suffer a logic error in the future, because storing conditionals is dangerous (it makes your class a state machine).

Can you make the members in these classes _private while you're here?

ryanjulian · 2020-01-19T21:04:35Z

src/garage/np/algos/off_policy_rl_algorithm.py

@@ -65,7 +65,7 @@ def __init__(self,
        self.min_buffer_size = min_buffer_size
        self.rollout_batch_size = rollout_batch_size
        self.reward_scale = reward_scale
-        self.evaluate = False
+        self._first_buffer_size_steps_done = False


This tracks whether the replay buffer is at least min_buffer_size right? So why not just test that directly?

self.replay_buffer.n_transitions_stored >= self.min_buffer_size

Perhaps the original author was concerned about having such a long conditional everywhere? If so, you can add a helper property:

@property def _buffer_prefilled(self): return self.replay_buffer.n_transitions_stored >= self.min_buffer_size

ryanjulian · 2020-01-19T21:05:48Z

src/garage/np/algos/off_policy_rl_algorithm.py

@@ -65,7 +65,7 @@ def __init__(self,
        self.min_buffer_size = min_buffer_size
        self.rollout_batch_size = rollout_batch_size
        self.reward_scale = reward_scale
-        self.evaluate = False
+        self._first_buffer_size_steps_done = False


This tracks whether the replay buffer is at least min_buffer_size right? So why not just test that directly?

self.replay_buffer.n_transitions_stored >= self.min_buffer_size

Perhaps the original author was concerned about having such a long conditional everywhere? If so, you can add a helper property:

@property def _buffer_prefilled(self): return self.replay_buffer.n_transitions_stored >= self.min_buffer_size

codecov · 2020-01-20T01:54:55Z

Codecov Report

Merging #1139 into master will increase coverage by 0.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #1139      +/-   ##
==========================================
+ Coverage   87.98%   87.99%   +0.01%     
==========================================
  Files         184      184              
  Lines        8780     8771       -9     
  Branches     1108     1108              
==========================================
- Hits         7725     7718       -7     
+ Misses        856      854       -2     
  Partials      199      199

Impacted Files	Coverage Δ
src/garage/np/algos/off_policy_rl_algorithm.py	`96.77% <100%> (+0.05%)`	⬆️
src/garage/tf/algos/ddpg.py	`93% <100%> (-0.15%)`	⬇️
src/garage/torch/algos/ddpg.py	`93.87% <100%> (-0.25%)`	⬇️
src/garage/tf/algos/dqn.py	`97.77% <100%> (-0.08%)`	⬇️
src/garage/misc/tensor_utils.py	`93.25% <0%> (+2.24%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e226e24...ccf70c5. Read the comment docs.

ryanjulian · 2020-02-17T04:04:07Z

@krzentner is this PR still relevant?

- also does duplicate logging removal

zequnyu requested a review from a team as a code owner January 19, 2020 20:00

ryanjulian approved these changes Jan 19, 2020

View reviewed changes

ryanjulian requested a review from a team January 19, 2020 21:07

ghost requested review from cbfinn and gitanshu January 19, 2020 21:07

zequnyu force-pushed the rename_self_evaluate branch from 66fc298 to 7d6bf26 Compare January 20, 2020 01:54

zequnyu requested review from ahtsan and gitanshu and removed request for cbfinn, gitanshu and a team January 20, 2020 01:56

ryanjulian approved these changes Jan 20, 2020

View reviewed changes

ryanjulian removed the request for review from gitanshu January 20, 2020 02:07

ahtsan approved these changes Jan 20, 2020

View reviewed changes

Rename self.evaluate for off-policy algos

ccf70c5

- also does duplicate logging removal

zequnyu force-pushed the rename_self_evaluate branch from 7d6bf26 to ccf70c5 Compare March 6, 2020 20:55

zequnyu added the ready-to-merge label Mar 6, 2020

mergify bot merged commit 8394f0c into master Mar 6, 2020

mergify bot deleted the rename_self_evaluate branch March 6, 2020 23:54

zequnyu mentioned this pull request Mar 7, 2020

Refactor self.evaluate to class property #1125

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rename self.evaluate for off-policy algos #1139

Rename self.evaluate for off-policy algos #1139

zequnyu commented Jan 19, 2020 •

edited

Loading

ryanjulian left a comment

ryanjulian Jan 19, 2020

ryanjulian Jan 19, 2020

codecov bot commented Jan 20, 2020 •

edited

Loading

ryanjulian commented Feb 17, 2020

Rename self.evaluate for off-policy algos #1139

Rename self.evaluate for off-policy algos #1139

Conversation

zequnyu commented Jan 19, 2020 • edited Loading

ryanjulian left a comment

Choose a reason for hiding this comment

ryanjulian Jan 19, 2020

Choose a reason for hiding this comment

ryanjulian Jan 19, 2020

Choose a reason for hiding this comment

codecov bot commented Jan 20, 2020 • edited Loading

Codecov Report

ryanjulian commented Feb 17, 2020

zequnyu commented Jan 19, 2020 •

edited

Loading

codecov bot commented Jan 20, 2020 •

edited

Loading