Skip to content
This repository has been archived by the owner on Jun 13, 2024. It is now read-only.

feature request: batched get_optimistic_exploration_action #3

Open
samuelstanton opened this issue Dec 20, 2019 · 1 comment
Open

Comments

@samuelstanton
Copy link

Would it be straighforward to implement a batched version of get_optimistic_exploration_action?

@quanvuong
Copy link
Collaborator

Hi Samuel,

It is doable (a few hours effort), but not straightforward (< 20 minutes).

This is because:

  1. the function get_optimistic_exploration_action requires computing one backward pass and per-sample gradient (and not sum of gradients over a batch of input).
  2. AFAIK, pytorch doesn't support computing per-sample gradient.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants