Added support for new PettingZoo API #751

Markus28 · 2022-09-26T18:16:48Z

Maybe someone who really knows PZ should do a sanity check here. I simply changed step to be compatible with both the old and the new step API. Removal of seeding was seemingly already anticipated in the existing code.

codecov-commenter · 2022-09-27T02:17:48Z

Codecov Report

Merging #751 (3236a9b) into master (b0c8d28) will increase coverage by 2.18%.
The diff coverage is 85.71%.

@@            Coverage Diff             @@
##           master     #751      +/-   ##
==========================================
+ Coverage   89.42%   91.60%   +2.18%     
==========================================
  Files          70       70              
  Lines        4925     4934       +9     
==========================================
+ Hits         4404     4520     +116     
+ Misses        521      414     -107

Flag	Coverage Δ
unittests	`91.60% <85.71%> (+2.18%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tianshou/env/pettingzoo_env.py	`88.13% <85.71%> (+64.13%)`	⬆️
tianshou/data/batch.py	`98.99% <0.00%> (+0.50%)`	⬆️
tianshou/policy/base.py	`80.51% <0.00%> (+0.64%)`	⬆️
tianshou/trainer/base.py	`96.90% <0.00%> (+1.54%)`	⬆️
tianshou/policy/modelfree/dqn.py	`94.38% <0.00%> (+3.37%)`	⬆️
tianshou/trainer/utils.py	`97.05% <0.00%> (+5.88%)`	⬆️
tianshou/policy/random.py	`100.00% <0.00%> (+41.66%)`	⬆️
tianshou/policy/multiagent/mapolicy.py	`83.90% <0.00%> (+68.96%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

WillDudley

Thanks for the PR. I contend your choice to change the syntax, but I'm open to being convinced.

WillDudley · 2022-09-27T13:56:13Z

tianshou/env/pettingzoo_env.py

@@ -57,7 +57,8 @@ def __init__(self, env: BaseWrapper):

    def reset(self, *args: Any, **kwargs: Any) -> Union[dict, Tuple[dict, dict]]:
        self.env.reset(*args, **kwargs)
-        observation, _, _, info = self.env.last(self)
+        last_return = self.env.last(self)
+        observation, info = last_return[0], last_return[-1]


Why have you done this rather than observation, _, _, _, info = self.env.last(self) ? This would keep the style of Tianshou, Gym and PZ.

See my comment below

tianshou/env/pettingzoo_env.py

Markus28 · 2022-09-27T15:16:05Z

Yes, I completely agree with your remarks. The code was more readable before, due to the explicit labeling of the return values.

I chose to do it this way to keep backward compatibility with older PZ versions (which use the old step API) at the expense of readability.
However, I'd also be open to imposing pettingzoo>=1.21.0 and using your proposed syntax. Unfortunately, I would expect this to break some user code.

We have a similar dilemma for Gym, but I don't believe that we should impose gym>=0.26.0 since this would probably break a lot of user code and (afaik) envpool hasn't caught up with the API changes.

WillDudley · 2022-09-27T18:51:53Z

@Markus28

Very good points. Indeed, some users may wish to use a new Tianshou feature with an old PZ env. However, I believe we do have wrappers to convert old PZ envs to new PZ envs pretty easily.

But the question is will the rest of Tianshou's update preserve backwards compat? Are you saying it will?

If it will, fab, but perhaps there should be warnings making users aware that:
a) the env they are using is old and may be incompatible with future updates, features & algos
b) it's easy to update the env by adding one extra line

Markus28 · 2022-09-27T20:08:21Z

@WillDudley my changes to tianshou should be backward compatible with respect to Gym and PettingZoo in the sense that you should be able to use old-style environments without any issues. However, it is not entirely compatible with respect to existing user code since my changes may (slightly) break existing code that directly interacts with the replay buffer. I would expect very few users to actually be affected, given the abstraction offered by collectors and the BasePolicy.

So for all intents and purposes, I think one can call the changes backward compatible.

I agree that deprecation warnings may be reasonable. What do you think @Trinkle23897 @pseudo-rnd-thoughts?

WillDudley · 2022-09-30T21:04:41Z

Would personally vote for a depreciation warning encouraging users to upgrade their envs with a simple wrapper (for maintenance reasons), as well as code comments explaining that the unconventional code logic is due to backwards compatibility with PettingZoo<1.21.

@Markus28

WillDudley · 2022-10-01T19:49:25Z

LGTM!

Added support for new PettingZoo API

f330a00

Trinkle23897 previously approved these changes Sep 26, 2022

View reviewed changes

Merge branch 'master' into support_pz

7e224f5

WillDudley suggested changes Sep 27, 2022

View reviewed changes

Added deprecation warnings

0d08454

Markus28 dismissed Trinkle23897’s stale review via 0d08454 October 1, 2022 19:33

Added comments

0c124f8

Trinkle23897 approved these changes Oct 2, 2022

View reviewed changes

Merge branch 'master' into support_pz

3236a9b

Trinkle23897 merged commit 128feb6 into thu-ml:master Oct 2, 2022

BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024

Added support for new PettingZoo API (thu-ml#751)

f49a143

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for new PettingZoo API #751

Added support for new PettingZoo API #751

Markus28 commented Sep 26, 2022

codecov-commenter commented Sep 27, 2022 •

edited

Loading

WillDudley left a comment

WillDudley Sep 27, 2022

Markus28 Sep 27, 2022

Markus28 commented Sep 27, 2022 •

edited

Loading

WillDudley commented Sep 27, 2022 •

edited

Loading

Markus28 commented Sep 27, 2022 •

edited

Loading

WillDudley commented Sep 30, 2022

WillDudley commented Oct 1, 2022

Added support for new PettingZoo API #751

Added support for new PettingZoo API #751

Conversation

Markus28 commented Sep 26, 2022

codecov-commenter commented Sep 27, 2022 • edited Loading

Codecov Report

WillDudley left a comment

Choose a reason for hiding this comment

WillDudley Sep 27, 2022

Choose a reason for hiding this comment

Markus28 Sep 27, 2022

Choose a reason for hiding this comment

Markus28 commented Sep 27, 2022 • edited Loading

WillDudley commented Sep 27, 2022 • edited Loading

Markus28 commented Sep 27, 2022 • edited Loading

WillDudley commented Sep 30, 2022

WillDudley commented Oct 1, 2022

codecov-commenter commented Sep 27, 2022 •

edited

Loading

Markus28 commented Sep 27, 2022 •

edited

Loading

WillDudley commented Sep 27, 2022 •

edited

Loading

Markus28 commented Sep 27, 2022 •

edited

Loading