Same seed different results #34

GPaolo · 2019-11-14T12:59:47Z

Hi,
If I set the seed, I get different results between different runs.

I am attaching a small script to reproduce the issue:

import gym
import pybullet
import pybulletgym

env = gym.make('AntMuJoCoEnv-v0')

env.seed(7)

obs = env.reset()
act = env.action_space.sample()
print(act)

obs = env.reset()
act = env.action_space.sample()
print(act)

The results I obtain are:

[ 0.36626342  0.64521587 -0.06823247  0.3184726   0.49362287  0.06656755
  0.8346416   0.71423185]
[-0.88181156 -0.4371754   0.45050308  0.7611305   0.27231112 -0.43260667
 -0.3026008  -0.3178519 ]

And they change among any run.

I think this is a bug, cause the result should be always the same, given the same seed.
Or am I doing something wrong?

The text was updated successfully, but these errors were encountered:

benelot · 2019-11-15T14:40:27Z

Hello! Thanks for mentioning this! Can you check if you get the same issue on the pybullet envs as well? Since the reset of the state is handled directly by pybullet through its loading and saving mechanism, I do not have any influence on the deterministic execution of different runs.

GPaolo · 2019-11-15T15:53:19Z

Just tested. To have similar results among different runs, there is the need to set the seed also for the action space and observation space:

env.seed(7)
env.action_space.seed(7)
env.observation_space.seed(7)

To have the same results among different resets, the seed needs to be reset everytime. This script:

import gym
import pybullet
import pybulletgym

env = gym.make('AntPyBulletEnv-v0')

env.seed(7)
env.action_space.seed(7)
env.observation_space.seed(7)

obs = env.reset()
act = env.action_space.sample()
print(act)

env.seed(7)
env.action_space.seed(7)
env.observation_space.seed(7)

obs = env.reset()
act = env.action_space.sample()
print(act)

returns:

[-0.44954607  0.83736265 -0.20760961  0.75181586 -0.01520521  0.25760308
  0.06112269 -0.45786014]
[-0.44954607  0.83736265 -0.20760961  0.75181586 -0.01520521  0.25760308
  0.06112269 -0.45786014]

This happens with whatever environment I tested.

benelot · 2019-11-15T16:11:27Z

If you reset the seed in my envs every time, does that help too? If so, then we fix this to be stored across resets.

GPaolo · 2019-11-21T14:11:55Z

Yes, I tried to set the seed before every reset with different environments from the repo and it seems to work consistently.

I also think that the .seed() method should set not only the env seed, but also the action_space and the observation_space ones. At least to give consistency, given that in other Gym environments the only function I had to call to set the seed was .seed().

benelot · 2019-11-21T21:17:59Z

Ok, so we make that functionality consistent with mujoco and make it store the seed across resets.

…

On Thu, Nov 21, 2019, 15:11 Giuseppe Paolo ***@***.***> wrote: Yes, I tried to set the seed before every reset with different environments from the repo and it seems to work consistently. I also think that the .seed() method should set not only the env seed, but also the action_space and the observation_space ones. At least to give consistency, given that in other Gym environments the only function I had to call to set the seed was .seed(). — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#34?email_source=notifications&email_token=AAXXXK7K5CZ55OLKZPPTYF3QU2JKXA5CNFSM4JNLEHI2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEE2LGKY#issuecomment-557101867>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAXXXKZQX4MBRBFSWWADJTDQU2JKXANCNFSM4JNLEHIQ> .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Same seed different results #34

Same seed different results #34

GPaolo commented Nov 14, 2019 •

edited

benelot commented Nov 15, 2019

GPaolo commented Nov 15, 2019 •

edited

benelot commented Nov 15, 2019

GPaolo commented Nov 21, 2019

benelot commented Nov 21, 2019 via email

Same seed different results #34

Same seed different results #34

Comments

GPaolo commented Nov 14, 2019 • edited

benelot commented Nov 15, 2019

GPaolo commented Nov 15, 2019 • edited

benelot commented Nov 15, 2019

GPaolo commented Nov 21, 2019

benelot commented Nov 21, 2019 via email

GPaolo commented Nov 14, 2019 •

edited

GPaolo commented Nov 15, 2019 •

edited