Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reproduce result in paper #1

Open
franktseng0718 opened this issue Dec 18, 2022 · 0 comments
Open

Reproduce result in paper #1

franktseng0718 opened this issue Dec 18, 2022 · 0 comments

Comments

@franktseng0718
Copy link

I'd like to reproduce the result in your paper, but after I run the code this repo based on these settings(seed=1, reward_type=op, episodes=3000), the result in 1458 seems similar of the paper, but the result in 3427 seems inferior than the paper a lot. It seems even inferior than HB method. Below is my result.
image
image
image
image
image
image
The click value fluctuates a lot even in train in 0.25, 0.125 setting. I wonder is it normal? And how can I get the result in paper?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant