break the dependency on the gym-http-api server -- call the gym directly #6

stites · 2017-05-18T16:22:43Z

one option might be to call the gym directly with call-python-via-msgpack:
https://github.com/nh2/call-python-via-msgpack

this would speed up performance considerably and would make me feel more comfortable about uploading to hackage.

stites · 2017-08-28T16:01:12Z

This can be condensed down to a simple Main.hs example file off the top of my head (quasiquotes or strings both would be fine):

main :: IO ()
main = do
  startpython                          -- maybe we need some optional kick-off process
  callpython    [| import gym |]       -- can make effectful imports
  callpython    [| counter = [0,1] |]  -- can initialize statically
  callpython    [| counter |]          -- callpython can be stupid
  c <- returnpy [| counter |]          -- some kind of way to "get stuff out"
  print c                              -- prints '[0,1]', stringly-typed is fine
  let a = show 1                       -- can take a haskell variable...
  callpython    [| counter[1] += a |]  -- ...and do something with it
  c <- returnpy [| counter |]          -- and changes can still be extracted
  print c                              -- prints '(0,2)', our final output

I'm showing quasiquotes because dealing with template haskell won't be so bad since we can section this off into a reinforce-environments-gym package -- also, i've heard that there is a library that can do python interop with QQs. Just using strings (or something smarter) and dodging template haskell compile issues would be nice, too : P

stites · 2017-10-01T00:09:03Z

small update on this -- turn out that the gym itself isn't compatible with cython, so I am guessing we'll be stuck with this dependency for longer than expected. I've split out the gym code into the reinforce-environments-gym in the meantime. I opened up #20, which has to be done anyhow, and I think it might be more prudent to wait on this submodule.

KiaraGrouwstra · 2018-06-05T07:09:43Z

fwiw I looked into openai/retro for the contest -- I managed to make that go over the gym-http-api wire, but Python JSON serialization of emulator observations obliterated performance. Looking into how openai/retro was implemented, it looks getting that into Haskell seems a matter of porting over one python file or so.

stites · 2018-06-05T15:26:50Z

Awesome! Yeah, every now and then I start porting over a environment from the toy problems and classic envs. Ideally reinforce natively implements its own emulators and drops the gym-http-api dependency entirely (which seems like a really flawed way to manage language bindings).

KiaraGrouwstra · 2018-06-05T19:33:17Z

Yeah, that'd be great.
What alternative would you have suggested over an HTTP API? Cuz porting directly would imply effort for each env/language combination. :/

stites · 2018-06-06T01:23:52Z

Porting directly used to be the plan-of-action. I was also thinking that haskell could just bypass the gym and call https://github.com/mgbellemare/Arcade-Learning-Environment directly.

stites · 2018-06-06T01:24:21Z

Basically any C++ gym alternatives are fair game, so long as they extern to C.

KiaraGrouwstra · 2018-06-06T17:16:00Z

Honestly, with Retro I think they did a great job in terms of making the Python little but a thin wrapper over the C++. But yeah, C++ envs consumed by FFI does sound like an interesting compromise.

That said, it's probably mostly that using other languages than Python is considered unusual in ML now.

So... either Haskell isn't justified here (matching your debugging issues), or it's better and we're gonna have to convince more people of that.

But yeah, considering the likes of TF/PyTorch all prioritize Python...

stites added the help wanted label May 18, 2017

stites changed the title ~~benchmark gym-http-api with call-python-via-msgpack~~ break the dependency on the gym-http-api server -- call the gym directly Aug 1, 2017

stites added this to the v0.1.0 - Hackage-ready milestone Aug 1, 2017

stites added good-first-issue blocker labels Aug 1, 2017

stites mentioned this issue Mar 1, 2019

Convert Q-learning (and/or Sarsa) into an example agent in reinforce-zoo #10

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

break the dependency on the gym-http-api server -- call the gym directly #6

break the dependency on the gym-http-api server -- call the gym directly #6

stites commented May 18, 2017 •

edited

Loading

stites commented Aug 28, 2017 •

edited

Loading

stites commented Oct 1, 2017 •

edited

Loading

KiaraGrouwstra commented Jun 5, 2018

stites commented Jun 5, 2018 •

edited

Loading

KiaraGrouwstra commented Jun 5, 2018

stites commented Jun 6, 2018

stites commented Jun 6, 2018

KiaraGrouwstra commented Jun 6, 2018

break the dependency on the gym-http-api server -- call the gym directly #6

break the dependency on the gym-http-api server -- call the gym directly #6

Comments

stites commented May 18, 2017 • edited Loading

stites commented Aug 28, 2017 • edited Loading

stites commented Oct 1, 2017 • edited Loading

KiaraGrouwstra commented Jun 5, 2018

stites commented Jun 5, 2018 • edited Loading

KiaraGrouwstra commented Jun 5, 2018

stites commented Jun 6, 2018

stites commented Jun 6, 2018

KiaraGrouwstra commented Jun 6, 2018

stites commented May 18, 2017 •

edited

Loading

stites commented Aug 28, 2017 •

edited

Loading

stites commented Oct 1, 2017 •

edited

Loading

stites commented Jun 5, 2018 •

edited

Loading