Python Bug: lambda function refers only one environment #1155

maguro27 · 2024-05-28T13:55:06Z

I have marked all applicable categories:
- exception-raising bug
- RL algorithm bug
- documentation request (i.e. "X is missing from the documentation.")
- new feature request
- design request (i.e. "X should be changed to Y.")
I have visited the source website
I have searched through the issue tracker for duplicates

I have mentioned version numbers, operating system and environment, where applicable:

import tianshou, gymnasium as gym, torch, numpy, sys
print(tianshou.__version__, gym.__version__, torch.__version__, numpy.__version__, sys.version, sys.platform)
0.5.1 0.29.1 2.3.0a0+40ec155e58.nv24.03 1.24.4 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0] linux

The tutorial uses the lambda function for making callable environment functions many times.
However, I confirmed the Python bug (Python 3.10.12) when I use user-defined environment that is shown as follows,

train_envs: list[gym.Env] = making_my_env() # e.g., len(train_envs) == 4
ts_train_envs = DummyVectorEnv([lambda: env for env in train_envs])
tmp_envs = [lambda: env for env in train_envs]

for env in tmp_envs:
    print(env().reset())
for env in ts_train_envs._env_fns:
    print(env().reset())

Then, I can get the same return values of the environment, but stacked 4.
Hence, I fix the above code as follows,

def callable_env(env: gym.Env) -> Callable:
    def _callable_env() -> gym.Env:
        return env

    return _callable_env

train_envs: list[gym.Env] = making_my_env() # e.g., len(train_envs) == 4
ts_train_envs = DummyVectorEnv([callable_env(env) for env in train_envs])
tmp_envs = [callable_env(env) for env in train_envs]

for env in tmp_envs:
    print(env().reset())
for env in ts_train_envs._env_fns:
    print(env().reset())

This works properly.
In conclusion, I suggest that the tutorial should not use the lambda function.

MischaPanch · 2024-05-29T16:26:22Z

You are using a very old version of tianshou. Could you pls try on either 1.0.0 or on the version on master?

maguro27 · 2024-05-31T12:38:53Z

@MischaPanch
I update the python and tianshou version.

1.0.0 0.28.1 2.3.0+cu121 1.24.4 3.11.9 (main, Apr 6 2024, 17:59:24) [GCC 9.4.0] linux

However, this lambda function issue remains.

dantp-ai · 2024-06-03T18:53:47Z

Hi @maguro27,

It seems that env is not looked up until the lambda function is called, but by the end of the loop, env is bound to the last element in the list, hence you get the last environment four times. You can read more about it and closures with lambdas here.

This should now work as expected since the default value for env is evaluated when the lambda function is defined:

ts_train_envs = DummyVectorEnv([lambda: env=env for env in train_envs])

Which Tianshou tutorial are you looking at?

maguro27 · 2024-06-07T13:51:59Z

@dantp-ai

Thank you for your comments.
I understand I misunderstood the behavior of the lambda function.

Tianshou tutorials only use default gym environments.
Hence, I misunderstand the behavior.
Therefore, I think maintainers might want to add information for using custom environments (e.g., use the gymnasium register function, then use it. or use "lambda: env=env".).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python Bug: lambda function refers only one environment #1155

Python Bug: lambda function refers only one environment #1155

maguro27 commented May 28, 2024 •

edited

Loading

MischaPanch commented May 29, 2024

maguro27 commented May 31, 2024

dantp-ai commented Jun 3, 2024 •

edited

Loading

maguro27 commented Jun 7, 2024

Python Bug: lambda function refers only one environment #1155

Python Bug: lambda function refers only one environment #1155

Comments

maguro27 commented May 28, 2024 • edited Loading

MischaPanch commented May 29, 2024

maguro27 commented May 31, 2024

dantp-ai commented Jun 3, 2024 • edited Loading

maguro27 commented Jun 7, 2024

maguro27 commented May 28, 2024 •

edited

Loading

dantp-ai commented Jun 3, 2024 •

edited

Loading