[RLlib] - Get and set states in `MultiAgentEpisode` and `SingleAgentEpisode` #45012

simonsays1980 · 2024-04-27T11:39:19Z

Why are these changes needed?

This PR adds get_state and from_state to MultiAgentEpisode and SingleAgentEpisode. This is needed for checkpointing EpisodeReplayBuffer objects.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

…get_state' and 'from_state' to 'SingleAGentEpisode' together with test. Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

…ith a test. Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

sven1977 · 2024-04-29T09:06:24Z

rllib/algorithms/ppo/ppo_learner.py

@@ -44,7 +44,7 @@ def build(self) -> None:
        # Note that the KL coeff is not controlled by a Scheduler, but seeks
        # to stay close to a given kl_target value in our implementation of
        # `self.additional_update_for_module()`.
-        self.curr_kl_coeffs_per_module: Dict[ModuleID, Scheduler] = LambdaDefaultDict(
+        self.curr_kl_coeffs_per_module: Dict[ModuleID, TensorType] = LambdaDefaultDict(


sven1977 · 2024-04-29T09:07:22Z

rllib/env/multi_agent_episode.py

@@ -1704,50 +1704,84 @@ def get_state(self) -> Dict[str, Any]:
        return list(
            {
                "id_": self.id_,
-                "agent_ids": self.agent_ids,
+                "agent_to_module_mapping_fn": self.agent_to_module_mapping_fn,


Actually, let's make the state a dict. States should always be Dict[str, Any]. I think this is a leftover from the early DreamerV3 days :)

sven1977 · 2024-04-29T09:07:40Z

rllib/env/multi_agent_episode.py

            }.items()
        )

    @staticmethod
-    def from_state(state) -> None:
+    def from_state(state) -> "MultiAgentEpisode":


same here: typehint: state: Dict[str, Any]

sven1977 · 2024-04-29T09:07:52Z

rllib/env/multi_agent_episode.py

        """Creates a multi-agent episode from a state dictionary.

        See `MultiAgentEpisode.get_state()` for creating a state for
        a `MultiAgentEpisode` pickable state. For recreating a
        `MultiAgentEpisode` from a state, this state has to be complete,
        i.e. all data must have been stored in the state.
+
+        Args:
+            state: A list of tuples containing all data required to recreate


same here, let's make this a dict.

sven1977 · 2024-04-29T09:08:44Z

rllib/env/single_agent_episode.py

@@ -1643,6 +1643,75 @@ def agent_steps(self) -> int:
        """
        return self.env_steps()

+    def get_state(self) -> list:


same here: dict

sven1977 · 2024-04-29T09:09:06Z

rllib/utils/replay_buffers/tests/test_multi_agent_episode_replay_buffer.py

@@ -342,6 +342,24 @@ def test_sample_with_modules_to_sample(self):
                # Assert that all n-steps are 1.0 as passed into `sample`.
                self.assertTrue(np.all(n_steps - 1.0 < tolerance))

+    # def test_get_state_and_set_state(self):


nit: Remove?

sven1977

Looks good. Just needs the changes from list to dict, then can be merged. :)
Thanks @simonsays1980 !

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

…pisode`. (ray-project#45012)

simonsays1980 added 10 commits September 8, 2023 15:22

Initiated MARWIL RL Module and added catalog, learner and tf_learner.

cbfd05f

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Added MARWIL RL Module and started to write test.

c488da7

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Merge branch 'master' into marwil-rl-module

9078af8

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Implemented Torch version of MARWIL.

b4e1795

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Added torch learner.

5eeb2e6

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Merge branch 'master' into ma-episode-replay-buffer-sample-dict

d331088

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Checked out MARWIL folder from master and remvoed local ones. Added '…

d91508c

…get_state' and 'from_state' to 'SingleAGentEpisode' together with test. Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Replaced unittest assertions with RLlib's 'check' function.

5b73fd2

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Added 'get_state' and 'from_state' to 'MultiAgentEpisode' tiogether w…

2e3bedb

…ith a test. Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

LINTER.

afb19b0

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

simonsays1980 self-assigned this Apr 27, 2024

simonsays1980 added rllib RLlib related issues rllib-newstack labels Apr 27, 2024

Merge branch 'master' into ma-episode-replay-buffer-sample-dict

ddc4912

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

sven1977 reviewed Apr 29, 2024

View reviewed changes

sven1977 approved these changes Apr 29, 2024

View reviewed changes

sven1977 marked this pull request as ready for review April 29, 2024 09:10

sven1977 requested review from avnishn, ArturNiederfahrenhorst, maxpumperla and kouroshHakha as code owners April 29, 2024 09:10

simonsays1980 added 3 commits April 29, 2024 11:23

Changed episode states from list to dict.

04581aa

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Merge branch 'master' into ma-episode-replay-buffer-sample-dict

408e2b6

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

Merge branch 'master' into ma-episode-replay-buffer-sample-dict

148eb0b

Signed-off-by: Simon Zehnder <simon.zehnder@gmail.com>

sven1977 merged commit a45bfe3 into ray-project:master Apr 30, 2024
5 checks passed

harborn pushed a commit to harborn/ray that referenced this pull request May 8, 2024

[RLlib] - Get and set states in MultiAgentEpisode and `SingleAgentE…

287ecc0

…pisode`. (ray-project#45012)

ryanaoleary pushed a commit to ryanaoleary/ray that referenced this pull request Jun 7, 2024

[RLlib] - Get and set states in MultiAgentEpisode and `SingleAgentE…

64ede15

…pisode`. (ray-project#45012)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] - Get and set states in `MultiAgentEpisode` and `SingleAgentEpisode` #45012

[RLlib] - Get and set states in `MultiAgentEpisode` and `SingleAgentEpisode` #45012

simonsays1980 commented Apr 27, 2024 •

edited

sven1977 Apr 29, 2024

sven1977 Apr 29, 2024

sven1977 Apr 29, 2024

sven1977 Apr 29, 2024

sven1977 Apr 29, 2024

sven1977 Apr 29, 2024

sven1977 left a comment

[RLlib] - Get and set states in MultiAgentEpisode and SingleAgentEpisode #45012

[RLlib] - Get and set states in MultiAgentEpisode and SingleAgentEpisode #45012

Conversation

simonsays1980 commented Apr 27, 2024 • edited

Why are these changes needed?

Related issue number

Checks

sven1977 Apr 29, 2024

Choose a reason for hiding this comment

sven1977 Apr 29, 2024

Choose a reason for hiding this comment

sven1977 Apr 29, 2024

Choose a reason for hiding this comment

sven1977 Apr 29, 2024

Choose a reason for hiding this comment

sven1977 Apr 29, 2024

Choose a reason for hiding this comment

sven1977 Apr 29, 2024

Choose a reason for hiding this comment

sven1977 left a comment

Choose a reason for hiding this comment

[RLlib] - Get and set states in `MultiAgentEpisode` and `SingleAgentEpisode` #45012

[RLlib] - Get and set states in `MultiAgentEpisode` and `SingleAgentEpisode` #45012

simonsays1980 commented Apr 27, 2024 •

edited