atari-py is based on old version of ALE #2

sygi · 2016-06-11T09:11:42Z

The code of ALE, which is included in atari-py package is based on an old version of ALE.
One difference I observed is in the file ale_interface/src/games/supported/Breakout.cpp, which was changed in ALE nearly a year ago, but I am sure there are more discrepancies.
I feel hesitant to change this one, as then the two repos'll diverge and things will get only worse.
Could you think of the way to keep the two repositories synchronized? (e.g. by including ALE as a git submodule and possibly updating it every half a year or so)

ppwwyyxx · 2016-07-05T18:08:54Z

Agreed. It is not wise to keep an independent branch and not updating with the upstream. This will discard a lot of available options in atari. For example, ALE recently introduced the option "repeat_action_probability" that is used by some latest papers.
Prior to this year, DeepMind papers treat live lost as the end of episode to make it easier to learn, but latest papers didn't use this hack.

Same thing for doom-py, it's not tracking upstream either.

gdb · 2016-07-05T18:12:31Z

I agree there's a real and somewhat unfortunate issue with snapshotting upstream.

However, there's a deeper issue here, which is that sometimes upstream changes will causes changes in semantics. This could break comparability across an environment version.

I'm not sure what the right balance is here. Perhaps we should publish different libraries (such as "atari_py_1" or something) corresponding to frozen versions of upstream. @joschu what do you think?

joschu · 2016-07-05T21:10:10Z

I think there's no reason to update ALE, as it already works.
Doom is being actively developed, so that's more debatable. Maybe doom-py should be put in the gym codebase.

shelhamer · 2016-10-05T22:22:23Z

I have two arguments for updating ALE (at least once, but presumably every now and again):

Fixes, such as highlighted for Breakout. These may alter semantics, and so they could require a version difference and would have to be incorporated with care.
New functionality, like loading/store system state (include pseudorandomness) that is useful for experiments that need to restore past states. This in particular makes no difference in the environment semantics. I'm raising this issue b.c. I have experiments that I'd like to port from direct ALE to the gym that require the decodeState(), restoreSystemState() methods.

For mechanism, I think a submodule is reasonable. This is what I've been doing in my own projects and it hasn't been any trouble. I'd be happy to help with this switch-over. Furthermore, the submodule preserves the versioning and authorship information from upstream, and so gives full attribution to the work.

@joschu @gdb what are your latest thoughts on updating the ALE?

gdb · 2016-10-09T00:23:32Z

@shelhamer I like it. We can always keep around the current version of atari-py for comparison with the existing algorithms. (Worst case, we'll end up creating multiple atari-py packages, so it's easy to have multiple version installed at once.)

If you're up for doing the work, I'd be very happy to support you. (Happy to give you commit bits!)

Closes #2

gdb · 2016-10-09T00:47:17Z

(Started doing some work in the above commit, but it's definitely not working properly.)

fgvbrt · 2017-01-07T16:16:23Z

Hi, I also was comparing current gym ALE with last ALE when I was trying to implement planning in atari games.
Specifically I found some strange behavior when calling cloneState/saveState and loadSate/restoreState for gym's ALE. For example if you create two exact same examples of ale with repeat_action probability==0. Then in one example call cloneState/saveState then loadSate/restoreState then act and for other just act with the same action and after it compare RAM and image states, they will be different. If you do same operations for latest ALE you won't see differences.
More extreme case is when you run whole episode for both environments with same actions but for one environment you call cloneState and restoreState after each step. I did this for skiing environment with action==noop and the one where I call cloneState and restoreState always finishes episode faster.
So my conclusion was that with gym's ALE you can not do planning now.

I also fount that for latest ALE in order to obtain screen RGB you should provide buffer with size wh3, not 4 like in gym's ALE. And I noticed that pixel values are slightly different for latest ALE compared to gym's ALE, when creating two environments with the same parameters and doing same actions, RAM states in that case are the same.
Hope it helps.

shelhamer · 2017-01-07T17:04:15Z

Right, sorry for the hold-up on this. As the parts of ALE pulled into atari-py made it via another project—and discarded versioning—the fix isn't free of hassle. I'll take another stab at this soon all the same since I have experiments that make use of restoreSystemState() that currently have to make direct use of ALE.

sygi · 2017-04-06T01:40:27Z

Thanks for working on that :)
Note that it does not introduce sync-able submodules, but rather updates atari-py to the current (for now) version -- if you're reading this comment from the future, the ALE and atari-py versions may still be divergent (again).

shelhamer · 2017-04-14T18:03:20Z

See #15 to track the status of atari-py with respect to recent ALE development and the inclusion of full snapshot + restore.

gdb added a commit that referenced this issue Oct 9, 2016

Switch to an ALE submodule

9b62fa0

Closes #2

pavitrakumar78 mentioned this issue Jan 8, 2017

Model performance issue Grzego/async-rl#1

Closed

shelhamer mentioned this issue Apr 4, 2017

Spring Cleaning the Build #10

Merged

joschu closed this as completed in #10 Apr 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

atari-py is based on old version of ALE #2

atari-py is based on old version of ALE #2

sygi commented Jun 11, 2016 •

edited

Loading

ppwwyyxx commented Jul 5, 2016 •

edited

Loading

gdb commented Jul 5, 2016

joschu commented Jul 5, 2016

shelhamer commented Oct 5, 2016 •

edited

Loading

gdb commented Oct 9, 2016

gdb commented Oct 9, 2016

fgvbrt commented Jan 7, 2017 •

edited

Loading

shelhamer commented Jan 7, 2017

sygi commented Apr 6, 2017 •

edited

Loading

shelhamer commented Apr 14, 2017

atari-py is based on old version of ALE #2

atari-py is based on old version of ALE #2

Comments

sygi commented Jun 11, 2016 • edited Loading

ppwwyyxx commented Jul 5, 2016 • edited Loading

gdb commented Jul 5, 2016

joschu commented Jul 5, 2016

shelhamer commented Oct 5, 2016 • edited Loading

gdb commented Oct 9, 2016

gdb commented Oct 9, 2016

fgvbrt commented Jan 7, 2017 • edited Loading

shelhamer commented Jan 7, 2017

sygi commented Apr 6, 2017 • edited Loading

shelhamer commented Apr 14, 2017

sygi commented Jun 11, 2016 •

edited

Loading

ppwwyyxx commented Jul 5, 2016 •

edited

Loading

shelhamer commented Oct 5, 2016 •

edited

Loading

fgvbrt commented Jan 7, 2017 •

edited

Loading

sygi commented Apr 6, 2017 •

edited

Loading