-
Notifications
You must be signed in to change notification settings - Fork 187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fully deterministic runs #43
Comments
Hi, are you asking for fully deterministic runs? I haven't paid much attention to this but I think the agent is already fully deterministic, so you'd probably just have to set the environment seed (make sure if you use more than 1 environment instance, that the environments have different seeds so they produce different data). |
Okay I will try that. Thank you for the quick response! What exactly is an "environment instance"? I couldn't find a clear definition in the paper. |
Also, how many seeds were the non-Minecraft experiments run for? |
+1 on the question above. Maybe it's not that apparent in the paper, could you also provide some clarification on what the confidence intervals denote in the non-minecraft experiments (DMLab, DMC Proprio, Crafter, etc)? Is it std-error across multiple seeds, or std-error across a window of timesteps with a single seed, or something else? |
It's mean/std across seeds and at least 3 seeds per task, often more. |
Update: I seeded dmc_control here. And still got non-deterministic runs. Are there other non-environment sources of randomness not seeded? |
I don't think those two methods are run ever. Could you check e.g. by adding |
Seeding this it removes randomness from the first 1000 steps, but runs are non-deterministic afterwards. |
Awesome repo. quick question,
I ran the DMC WalkerWalk experiment 3 different times with the same seeds and got 3 different learning curves. How can I get reproducible experiments?Awesome repo. quick question,
I ran the DMC WalkerWalk experiment 3 different times with the same seeds and got 3 different learning curves. How can I get reproducible experiments?
The text was updated successfully, but these errors were encountered: