[RLlib] IMPALA (any tf algo using entropy coeff) slowdown observed #15936

sven1977 · 2021-05-20T07:16:38Z

Since ray 1.2, IMPALA (and any other tf algorithm that uses entropy_coeff) has slowed down due to a bug.

The bug is caused by a tf-(static graph)-op being added to the graph each time we call on_global_var_update (defined inside the EntropyCoeffSchedule (tf) mixin class).

To reproduce:

rllib train -f rllib/tuned_examples/impala/pong-impala-fast.yaml

Runs much slower than in ray<=1.1.

What is the problem?

Ray version and other system information (Python version, TensorFlow version, OS):

Reproduction (REQUIRED)

Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):

If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".

I have verified my script runs in a clean environment and reproduces the issue.
I have verified the issue also occurs with the latest wheels.

The text was updated successfully, but these errors were encountered:

sven1977 added bug Something that is supposed to be working; but isn't release-blocker P0 Issue that blocks the release P0 Issue that must be fixed in short order rllib RLlib related issues labels May 20, 2021

sven1977 self-assigned this May 20, 2021

sven1977 mentioned this issue May 20, 2021

[RLlib] Entropy coeff schedule bug fix and git bisect script. #15937

Merged

6 tasks

sven1977 closed this as completed in #15937 May 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] IMPALA (any tf algo using entropy coeff) slowdown observed #15936

[RLlib] IMPALA (any tf algo using entropy coeff) slowdown observed #15936

sven1977 commented May 20, 2021 •

edited

[RLlib] IMPALA (any tf algo using entropy coeff) slowdown observed #15936

[RLlib] IMPALA (any tf algo using entropy coeff) slowdown observed #15936

Comments

sven1977 commented May 20, 2021 • edited

What is the problem?

Reproduction (REQUIRED)

sven1977 commented May 20, 2021 •

edited