Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement EXP3.R to tackle switching bandits ? #100

Closed
Naereen opened this issue Dec 12, 2017 · 1 comment
Closed

Implement EXP3.R to tackle switching bandits ? #100

Naereen opened this issue Dec 12, 2017 · 1 comment
Assignees
Labels
new algo I have to implement a new algorithm! Yay! non-stationary For non-stationary bandits simulations single-player For single-player bandits simulations

Comments

@Naereen
Copy link
Member

Naereen commented Dec 12, 2017

@Naereen Naereen added the new algo I have to implement a new algorithm! Yay! label Dec 12, 2017
@Naereen Naereen self-assigned this Dec 12, 2017
@Naereen Naereen changed the title Implement EXP3.R ? Implement EXP3.R to tackle switching bandits ? Apr 25, 2018
@Naereen Naereen added non-stationary For non-stationary bandits simulations single-player For single-player bandits simulations labels Oct 7, 2018
@Naereen
Copy link
Member Author

Naereen commented Nov 21, 2018

Two variants of Exp3R have been implemented: Exp3.R uses naive Exp3 as a base, and Exp3.R uses Exp3++ as a base.
They both work similarly. Not so great, but they work.

  • Mean reward:
    figure_1

  • Mean regret:
    figure_1

Naereen added a commit that referenced this issue Nov 21, 2018
@Naereen Naereen closed this as completed Nov 22, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new algo I have to implement a new algorithm! Yay! non-stationary For non-stationary bandits simulations single-player For single-player bandits simulations
Development

No branches or pull requests

1 participant