[RLlib] Minor fixes (torch GPU bugs + some cleanup). #11609

sven1977 · 2020-10-26T09:14:22Z

Minor fixes:

torch GPU bugs
some cleanup.
some test cases were deactivated (unintentionally?).
shortened some tests.
fix breaking test cases test_bc and test_marwil.py (due to json file w/ compressed obs not understood).

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

sven1977 · 2020-10-26T09:15:36Z

rllib/policy/tf_policy.py

@@ -331,7 +332,8 @@ def compute_actions(
        fetched = builder.get(to_fetch)

        # Update our global timestep by the batch size.
-        self.global_timestep += fetched[0].shape[0]
+        self.global_timestep += len(obs_batch) if isinstance(obs_batch, list) \


This may be a tensor (eager tracing) or a list.

sven1977 · 2020-10-26T09:15:48Z

rllib/tests/test_multi_agent_pendulum.py

@@ -28,7 +28,7 @@ def test_multi_agent_pendulum(self):
                        "env": "multi_agent_pendulum",
                        "stop": {
                            "timesteps_total": 500000,
-                            "episode_reward_mean": -300.0,
+                            "episode_reward_mean": -400.0,


Make some tests run a little faster.

sven1977 · 2020-10-26T09:16:04Z

rllib/tests/test_rollout_worker.py

@@ -187,39 +187,35 @@ def test_batch_ids(self):
    def test_global_vars_update(self):
        # Allow for Unittest run.
        ray.init(num_cpus=5, ignore_reinit_error=True)
-        for fw in framework_iterator(frameworks=()):
+        for fw in framework_iterator(frameworks=("tf2", "tf")):


This was completely deactivated! frameworks=empty

sven1977 · 2020-10-26T09:16:09Z

rllib/tests/test_rollout_worker.py

            agent.stop()

    def test_no_step_on_init(self):
        register_env("fail", lambda _: FailOnStepEnv())
-        for fw in framework_iterator(frameworks=()):
+        for fw in framework_iterator():


sven1977 · 2020-10-26T09:16:35Z

rllib/agents/impala/vtrace_tf_policy.py

@@ -80,7 +80,8 @@ def __init__(self,
                behaviour_policy_logits=behaviour_logits,
                target_policy_logits=target_logits,
                actions=tf.unstack(actions, axis=2),
-                discounts=tf.cast(~dones, tf.float32) * discount,
+                discounts=tf.cast(~tf.cast(dones, tf.bool), tf.float32) *


This would fail if dones are floats (0.0 or 1.0).

sven1977 · 2020-10-26T09:16:47Z

rllib/agents/pg/tests/test_pg.py

            if sess:
                expected_logp = sess.run(expected_logp)
+            elif fw == "torch":
+                expected_logp = expected_logp.detach().cpu().numpy()
+                adv = adv.detach().cpu().numpy()


failed on GPU

ericl · 2020-10-26T18:29:47Z

rllib/offline/json_reader.py

@@ -150,6 +150,7 @@ def _from_json(batch: str) -> SampleBatchType:

    if data_type == "SampleBatch":
        for k, v in data.items():
+            print("Trying to unpack {}: {}".format(k, v))#TODO


Please remove this prior to merging.

Of course, trying to catch the failing test_marwil/bc.py problem, which is related to a different compression format on the observation and will go into a different PR.

WIP.

7d3ff2a

sven1977 commented Oct 26, 2020

View reviewed changes

sven1977 requested a review from ericl October 26, 2020 09:17

sven1977 assigned ericl Oct 26, 2020

sven1977 added 3 commits October 26, 2020 11:39

WIP.

04e72ff

WIP.

8961175

WIP.

773e8ac

ericl reviewed Oct 26, 2020

View reviewed changes

ericl approved these changes Oct 26, 2020

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Oct 26, 2020

WIP.

e981e4e

sven1977 merged commit d9f1874 into ray-project:master Oct 27, 2020

cathrinS pushed a commit to mariodoebler/ray that referenced this pull request Nov 8, 2020

[RLlib] Minor fixes (torch GPU bugs + some cleanup). (ray-project#11609)

024cba3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Minor fixes (torch GPU bugs + some cleanup). #11609

[RLlib] Minor fixes (torch GPU bugs + some cleanup). #11609

sven1977 commented Oct 26, 2020 •

edited

sven1977 Oct 26, 2020

sven1977 Oct 26, 2020

sven1977 Oct 26, 2020

sven1977 Oct 26, 2020

sven1977 Oct 26, 2020

sven1977 Oct 26, 2020

ericl Oct 26, 2020

sven1977 Oct 26, 2020

[RLlib] Minor fixes (torch GPU bugs + some cleanup). #11609

[RLlib] Minor fixes (torch GPU bugs + some cleanup). #11609

Conversation

sven1977 commented Oct 26, 2020 • edited

Why are these changes needed?

Related issue number

Checks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sven1977 commented Oct 26, 2020 •

edited