Fix the CSV file reward lagging way behind the actual rewards #2120

ervteng · 2019-06-11T01:37:26Z

We aren't clearing the List called self.cumulative_returns_since_policy_update when we update the policy. This is used to compute the mean rewards to write to CSV, and it just gets longer and longer through training.

This PR clears it when we update the policy.

Before the CSV file's mean rewards would lag much behind the rest of the code since this buffer was never cleared.

Clear cumulative_returns_since_policy_update

af07a7d

Before the CSV file's mean rewards would lag much behind the rest of the code since this buffer was never cleared.

ervteng changed the base branch from master to develop June 11, 2019 01:37

ervteng requested a review from xiaomaogy June 11, 2019 01:38

Merge branch 'develop' into develop-fix-csvwriting

95cca19

xiaomaogy approved these changes Jun 11, 2019

View reviewed changes

xiaomaogy merged commit c5226f6 into develop Jun 11, 2019

xiaomaogy deleted the develop-fix-csvwriting branch June 11, 2019 17:56

sankalp04 pushed a commit that referenced this pull request Jun 21, 2019

Clear cumulative_returns_since_policy_update (#2120)

28a8423

Before the CSV file's mean rewards would lag much behind the rest of the code since this buffer was never cleared.

github-actions bot locked as resolved and limited conversation to collaborators May 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the CSV file reward lagging way behind the actual rewards #2120

Fix the CSV file reward lagging way behind the actual rewards #2120

Uh oh!

ervteng commented Jun 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix the CSV file reward lagging way behind the actual rewards #2120

Fix the CSV file reward lagging way behind the actual rewards #2120

Uh oh!

Conversation

ervteng commented Jun 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants