You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'd like to reproduce the result in your paper, but after I run the code this repo based on these settings(seed=1, reward_type=op, episodes=3000), the result in 1458 seems similar of the paper, but the result in 3427 seems inferior than the paper a lot. It seems even inferior than HB method. Below is my result.
The click value fluctuates a lot even in train in 0.25, 0.125 setting. I wonder is it normal? And how can I get the result in paper?
The text was updated successfully, but these errors were encountered:
I'd like to reproduce the result in your paper, but after I run the code this repo based on these settings(seed=1, reward_type=op, episodes=3000), the result in 1458 seems similar of the paper, but the result in 3427 seems inferior than the paper a lot. It seems even inferior than HB method. Below is my result.
![image](https://user-images.githubusercontent.com/52061290/208304260-f911b390-57ec-4053-b57d-1a4247652395.png)
![image](https://user-images.githubusercontent.com/52061290/208304357-e9c6b4f9-8676-4301-bdfe-eb8c21907287.png)
![image](https://user-images.githubusercontent.com/52061290/208304398-57e1cf4d-910c-4339-8d1a-87b5a35e91ac.png)
![image](https://user-images.githubusercontent.com/52061290/208304528-f76c1f91-be5d-4c57-adc0-b18a6f9d3369.png)
![image](https://user-images.githubusercontent.com/52061290/208304545-5677b7c7-a40b-42df-b45c-e1f00f08ea6b.png)
![image](https://user-images.githubusercontent.com/52061290/208299688-847f4299-eaaf-4815-b8ba-baf8496146b1.png)
The click value fluctuates a lot even in train in 0.25, 0.125 setting. I wonder is it normal? And how can I get the result in paper?
The text was updated successfully, but these errors were encountered: