[RLlib] Tf-eager policy bug fix: Duplicate model call in compute_gradients. #12682

sven1977 · 2020-12-08T21:17:39Z

This PR removes a bug in the EagerTFPolicy:

In self._compute_gradients we perform an unnecessary model call before calling the loss function. The model call has simply been removed.

Related issue number

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

WIP.

988f17c

sven1977 requested a review from ericl December 8, 2020 21:17

sven1977 assigned ericl Dec 8, 2020

ericl approved these changes Dec 8, 2020

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Dec 8, 2020

sven1977 merged commit 28108c9 into ray-project:master Dec 9, 2020

sven1977 deleted the tf_eager_bug_fix_duplicate_model_call branch June 2, 2023 20:13