Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement the corrected Exp3 and Exp4 algorithms #17

Closed
Naereen opened this issue Dec 19, 2016 · 2 comments
Closed

Implement the corrected Exp3 and Exp4 algorithms #17

Naereen opened this issue Dec 19, 2016 · 2 comments
Assignees
Labels
new algo I have to implement a new algorithm! Yay!

Comments

@Naereen
Copy link
Member

Naereen commented Dec 19, 2016

  • bd3f825 we should divide by the proba p_t of selecting actions, not by the trusts
@Naereen Naereen added the new algo I have to implement a new algorithm! Yay! label Dec 20, 2016
@Naereen Naereen self-assigned this Dec 20, 2016
@Naereen
Copy link
Member Author

Naereen commented Jan 25, 2017

  • Done for Aggr (Exp4)
  • TODO for Softmax (Exp3)

@Naereen
Copy link
Member Author

Naereen commented Jan 26, 2017

I don't know, and don't care, about how to do that for Exp3.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new algo I have to implement a new algorithm! Yay!
Development

No branches or pull requests

1 participant