Use eval_max_episode lengths in evalutation #1853

avnishn · 2020-08-04T23:18:25Z

This commit fixes a bug where when obtaining
eval trajectories with sac, the max_eval_episode_length
isn't actually used by the algorithm or anywhere.

This commit fixes a bug where when obtaining eval trajectories with sac, the max_eval_episode_length isn't actually used by the algorithm or anywhere.

ryanjulian · 2020-08-04T23:54:43Z

src/garage/torch/algos/sac.py

@@ -45,7 +45,7 @@ class SAC(RLAlgorithm):
            environment that the agent is being trained in. Usually accessable
            by calling env.spec.
        max_episode_length (int): Max path length of the algorithm.


this description doesn't make much sense -- can you make it longer?

ryanjulian · 2020-08-04T23:54:53Z

src/garage/torch/algos/sac.py

@@ -94,7 +94,7 @@ def __init__(
        replay_buffer,
        *,  # Everything after this is numbers.
        max_episode_length,
-        max_eval_path_length=None,
+        max_eval_episode_length=None,


Suggested change

max_eval_episode_length=None,

max_episode_length_eval=None,

For both of these, please explain what happens if the environment has a max_episode_length which is less than these numbers

ryanjulian · 2020-08-04T23:55:23Z

src/garage/torch/algos/sac.py

@@ -94,7 +94,7 @@ def __init__(
        replay_buffer,
        *,  # Everything after this is numbers.
        max_episode_length,


Suggested change

max_episode_length,

max_episode_length_explore,

if i don't set it, does it use the max_episode_length from the environment?

no, it is default 1000. I had done this when we were only doing apples to apples comparisons with the paper. A less confusing behavior would be do use max_episode_length_explore

i agree with that as long as max_episode_length_explore defaults to env.spec.max_episode_length

In general I think people shouldn't interact with these unless they have very specific needs (e.g. your baseline comparisons), e.g. this wouldn't appear in an example.

Use eval_max_episode lengths in evalutation

a020860

This commit fixes a bug where when obtaining eval trajectories with sac, the max_eval_episode_length isn't actually used by the algorithm or anywhere.

avnishn requested a review from a team as a code owner August 4, 2020 23:18

avnishn requested review from ahtsan and removed request for a team August 4, 2020 23:18

mergify bot requested review from a team, AiRuiChen and ziyiwu9494 and removed request for a team August 4, 2020 23:18

avnishn requested review from krzentner and removed request for AiRuiChen and ziyiwu9494 August 4, 2020 23:18

mergify bot requested review from a team, haydenshively and AiRuiChen and removed request for a team August 4, 2020 23:18

avnishn requested review from maliesa96 and removed request for haydenshively, AiRuiChen and a team August 4, 2020 23:18

mergify bot requested review from a team and yeukfu and removed request for a team August 4, 2020 23:18

avnishn removed the request for review from yeukfu August 4, 2020 23:18

avnishn added bug Something isn't working backport-to-2020.06 Backport this PR to release-2020.06 ready-to-merge labels Aug 4, 2020

ryanjulian reviewed Aug 4, 2020

View reviewed changes

avnishn mentioned this pull request Aug 16, 2020

MTSAC Max Path Length Incorrect #1903

Closed

avnishn linked an issue Aug 16, 2020 that may be closed by this pull request

MTSAC Max Path Length Incorrect #1903

Closed

avnishn mentioned this pull request Aug 16, 2020

Use max episode length eval in SAC evaluation #1908

Merged

avnishn closed this Aug 16, 2020

ryanjulian deleted the use_eval_path_length_SAC branch September 14, 2020 22:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use eval_max_episode lengths in evalutation #1853

Use eval_max_episode lengths in evalutation #1853

avnishn commented Aug 4, 2020

ryanjulian Aug 4, 2020

ryanjulian Aug 4, 2020

ryanjulian Aug 4, 2020

ryanjulian Aug 4, 2020

ryanjulian Aug 4, 2020

avnishn Aug 4, 2020

ryanjulian Aug 5, 2020

Use eval_max_episode lengths in evalutation #1853

Use eval_max_episode lengths in evalutation #1853

Conversation

avnishn commented Aug 4, 2020

ryanjulian Aug 4, 2020

Choose a reason for hiding this comment

ryanjulian Aug 4, 2020

Choose a reason for hiding this comment

ryanjulian Aug 4, 2020

Choose a reason for hiding this comment

ryanjulian Aug 4, 2020

Choose a reason for hiding this comment

ryanjulian Aug 4, 2020

Choose a reason for hiding this comment

avnishn Aug 4, 2020

Choose a reason for hiding this comment

ryanjulian Aug 5, 2020

Choose a reason for hiding this comment