[rllib] Fix LSTM regression on truncated sequences and add regression test #2898

ericl · 2018-09-17T20:37:06Z

What do these changes do?

#2700 regressed LSTM performance by erroneously zeroing out the initial state. This breaks training with truncated sequence lengths (e.g., the tuned pong example).

Incidentally, I also fixed an issue where Pong-ram doesn't pick up the right preprocessor by default.

Testing

Before this fix, the pong a3c example is completely broken. I checked that learning does happen after this fix.

Also added a stateless cartpole env test thanks to @richard4912 , and verified that that test fails before this fix.

AmplabJenkins · 2018-09-17T21:46:48Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/8268/
Test FAILed.

AmplabJenkins · 2018-09-18T02:14:33Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/8271/
Test FAILed.

AmplabJenkins · 2018-09-18T05:07:49Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/8272/
Test PASSed.

richardliaw · 2018-09-18T20:48:02Z

python/ray/rllib/examples/cartpole_lstm.py

@@ -0,0 +1,179 @@
+"""Stateless variant of the CartPole gym environment.


Stateless..? Might be the wrong word to use here..

Partially observed?

AmplabJenkins · 2018-09-18T22:09:31Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/8282/
Test FAILed.

ericl added 4 commits September 17, 2018 13:29

fix

28cfb66

add test

8208e04

yapf

fd391dd

yapf

3ccd219

ericl assigned richardliaw Sep 17, 2018

fix space

dd1b0b1

Oops that should be lstm: True

8de0f53

richardliaw approved these changes Sep 18, 2018

View reviewed changes

Update cartpole_lstm.py

e04b660

ericl merged commit 3a3782c into ray-project:master Sep 18, 2018

ericl added this to Done in RLlib Sep 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Fix LSTM regression on truncated sequences and add regression test #2898

[rllib] Fix LSTM regression on truncated sequences and add regression test #2898

ericl commented Sep 17, 2018

AmplabJenkins commented Sep 17, 2018

AmplabJenkins commented Sep 18, 2018

AmplabJenkins commented Sep 18, 2018

richardliaw Sep 18, 2018 •

edited

ericl Sep 18, 2018

AmplabJenkins commented Sep 18, 2018

		@@ -0,0 +1,179 @@
		"""Stateless variant of the CartPole gym environment.

[rllib] Fix LSTM regression on truncated sequences and add regression test #2898

[rllib] Fix LSTM regression on truncated sequences and add regression test #2898

Conversation

ericl commented Sep 17, 2018

What do these changes do?

Testing

AmplabJenkins commented Sep 17, 2018

AmplabJenkins commented Sep 18, 2018

AmplabJenkins commented Sep 18, 2018

richardliaw Sep 18, 2018 • edited

Choose a reason for hiding this comment

ericl Sep 18, 2018

Choose a reason for hiding this comment

AmplabJenkins commented Sep 18, 2018

richardliaw Sep 18, 2018 •

edited