Skip to content

Conversation

@ervteng
Copy link
Contributor

@ervteng ervteng commented Nov 18, 2020

Proposed change(s)

In the previous action clipping PR, actions were being clipped too late (in evaluate) and the BC module didn't receive the clipped actions from the Policy. This PR moves action clipping to sample_actions, which also fixes BC module + continuous actions.

We also compare learning rate to a small epsilon that will allow the BC module to stop training (taking time) when learning rate reaches a low value.

Types of change(s)

  • Bug fix
  • New feature
  • Code refactor
  • Breaking change
  • Documentation update
  • Other (please describe)

Checklist

  • Added tests that prove my fix is effective or that my feature works
  • Updated the changelog (if applicable)
  • Updated the documentation (if applicable)
  • Updated the migration guide (if applicable)

Other comments

@ervteng ervteng merged commit 3a9a200 into master Nov 18, 2020
@delete-merged-branch delete-merged-branch bot deleted the develop-fix-bc-clip branch November 18, 2020 19:35
ervteng pushed a commit that referenced this pull request Nov 18, 2020
@ervteng ervteng mentioned this pull request Nov 18, 2020
10 tasks
@github-actions github-actions bot locked as resolved and limited conversation to collaborators Nov 18, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants