Skip to content
This repository has been archived by the owner on Nov 5, 2022. It is now read-only.

Asking DQN-MCTS baseline code #2

Open
WenyuHan-LiNa opened this issue May 27, 2021 · 8 comments
Open

Asking DQN-MCTS baseline code #2

WenyuHan-LiNa opened this issue May 27, 2021 · 8 comments

Comments

@WenyuHan-LiNa
Copy link

Hello authors, I am very interested in your work. I am working on a DRL related work. Now, I am planning to add a DQN with MCTS to my project as you did. Would you please share the code or some implementation details about this MCTS baseline in your paper? Thank you ahead!

@jhamrick
Copy link
Contributor

Hi @WenyuHan-LiNa, thanks for your interest in our work!

Unfortunately I am not able to share the code for our MCTS implementation, but if you just want to do a standard DQN plus MCTS, that should be fairly straightforward to set up if you have (1) a standard DQN implementation and (2) a standard MCTS implementation, both of which you should be able to find multiple examples of elsewhere online. The main thing you will need to do is to modify the MCTS code to call your neural network at each node to estimate the Q-values, and then to use the action returned by the search rather than the one corresponding to the maximal Q-value. We tried to provide a lot of details in the appendices of both https://arxiv.org/pdf/1904.03177.pdf (see Appendix E) and https://arxiv.org/pdf/1912.02807.pdf (see Appendix A and in particular Algorithm A.1).

If you have any specific questions I am happy to try to clarify!

@WenyuHan-LiNa
Copy link
Author

WenyuHan-LiNa commented May 27, 2021 via email

@WenyuHan-LiNa
Copy link
Author

WenyuHan-LiNa commented May 29, 2021 via email

@jhamrick
Copy link
Contributor

jhamrick commented May 29, 2021 via email

@WenyuHan-LiNa
Copy link
Author

WenyuHan-LiNa commented May 29, 2021 via email

@jhamrick
Copy link
Contributor

jhamrick commented May 29, 2021 via email

@WenyuHan-LiNa
Copy link
Author

WenyuHan-LiNa commented May 29, 2021 via email

@WenyuHan-LiNa
Copy link
Author

WenyuHan-LiNa commented Jun 15, 2021 via email

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants