Will you provide QR-DQN code? #144

GoingMyWay · 2020-07-25T15:53:32Z

Will you provide QR-DQN code?

psc-g · 2020-07-27T13:32:05Z

planning on it, stay tuned! :)

On Sat, Jul 25, 2020 at 11:54 AM Alexander ***@***.***> wrote: Will you provide QR-DQN code? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#144>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE3CCMIANCVBNYIOTAD34A3R5L5Z7ANCNFSM4PHQRGJQ> .

GoingMyWay · 2020-07-28T07:51:03Z

planning on it, stay tuned! :)
…
On Sat, Jul 25, 2020 at 11:54 AM Alexander @.***> wrote: Will you provide QR-DQN code? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#144>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE3CCMIANCVBNYIOTAD34A3R5L5Z7ANCNFSM4PHQRGJQ .

Great, Can I ask how you do tuning and finding the optimal hyperparameters? By grid search?

psc-g · 2020-07-28T12:20:05Z

for the configs we've been releasing with dopamine we use the published settings. however, for DQN and C51 we used some of the settings from Rainbow when it was published (although we do provide configs with the settings used when each of those agents was published).

…

On Tue, Jul 28, 2020 at 3:51 AM Alexander ***@***.***> wrote: planning on it, stay tuned! :) … <#m_6244524444603058604_> On Sat, Jul 25, 2020 at 11:54 AM Alexander *@*.***> wrote: Will you provide QR-DQN code? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#144 <#144>>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE3CCMIANCVBNYIOTAD34A3R5L5Z7ANCNFSM4PHQRGJQ . Great, Can I ask how you do tuning and finding the optimal hyperparameters? By grid search? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#144 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE3CCMKYDOXYVJ5BWX76ZLDR5Z7PNANCNFSM4PHQRGJQ> .

GoingMyWay · 2020-08-01T06:26:56Z

for the configs we've been releasing with dopamine we use the published settings. however, for DQN and C51 we used some of the settings from Rainbow when it was published (although we do provide configs with the settings used when each of those agents was published).
…
On Tue, Jul 28, 2020 at 3:51 AM Alexander @.> wrote: planning on it, stay tuned! :) … <#m_6244524444603058604_> On Sat, Jul 25, 2020 at 11:54 AM Alexander @.> wrote: Will you provide QR-DQN code? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#144 <#144>>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE3CCMIANCVBNYIOTAD34A3R5L5Z7ANCNFSM4PHQRGJQ . Great, Can I ask how you do tuning and finding the optimal hyperparameters? By grid search? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#144 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE3CCMKYDOXYVJ5BWX76ZLDR5Z7PNANCNFSM4PHQRGJQ .

Thanks.

RylanSchaeffer · 2020-08-04T04:05:12Z

@GoingMyWay how'd you decide on Dopamine for distributional RL? What are your thoughts on DeepMind's acme?

GoingMyWay · 2020-08-04T04:35:16Z

@GoingMyWay how'd you decide on Dopamine for distributional RL? What are your thoughts on DeepMind's acme?

Hi, Dopamine has implemented C51, IQN, Rainbow and it is stable now, you can easily implement other Q-based distributional RL. For ACME, I did not use it before, it is very new, so maybe not stable and has some unfound bugs.

There are many mature frameworks like ACME, for example, ray.io, it also provides RL APIs for fast RL algorithm implementation.

psc-g · 2020-08-04T13:10:41Z

coming back to the original question on this thread, we now have a JAX implementation of QR-DQN: https://github.com/google/dopamine/blob/master/dopamine/jax/agents/quantile/quantile_agent.py

…

On Tue, Aug 4, 2020 at 12:35 AM Alexander ***@***.***> wrote: @GoingMyWay <https://github.com/GoingMyWay> how'd you decide on Dopamine for distributional RL? What are your thoughts on DeepMind's acme? Hi, Dopamine has implemented C51, IQN, Rainbow and it is stable now, you can easily implement other Q-based distributional RL. For ACME, I did not use it before, it is very new, so maybe not stable and has some unfound bugs. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#144 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE3CCMPPSM2YURDMM754JPTR66FZBANCNFSM4PHQRGJQ> .

GoingMyWay · 2020-08-04T14:27:47Z

coming back to the original question on this thread, we now have a JAX implementation of QR-DQN: https://github.com/google/dopamine/blob/master/dopamine/jax/agents/quantile/quantile_agent.py
…
On Tue, Aug 4, 2020 at 12:35 AM Alexander @.***> wrote: @GoingMyWay https://github.com/GoingMyWay how'd you decide on Dopamine for distributional RL? What are your thoughts on DeepMind's acme? Hi, Dopamine has implemented C51, IQN, Rainbow and it is stable now, you can easily implement other Q-based distributional RL. For ACME, I did not use it before, it is very new, so maybe not stable and has some unfound bugs. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#144 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE3CCMPPSM2YURDMM754JPTR66FZBANCNFSM4PHQRGJQ .

Great, @RylanSchaeffer You can try Dopamine.

GoingMyWay closed this as completed Aug 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will you provide QR-DQN code? #144

Will you provide QR-DQN code? #144

GoingMyWay commented Jul 25, 2020

psc-g commented Jul 27, 2020 via email

GoingMyWay commented Jul 28, 2020

psc-g commented Jul 28, 2020 via email

GoingMyWay commented Aug 1, 2020

RylanSchaeffer commented Aug 4, 2020

GoingMyWay commented Aug 4, 2020 •

edited

Loading

psc-g commented Aug 4, 2020 via email

GoingMyWay commented Aug 4, 2020

Will you provide QR-DQN code? #144

Will you provide QR-DQN code? #144

Comments

GoingMyWay commented Jul 25, 2020

psc-g commented Jul 27, 2020 via email

GoingMyWay commented Jul 28, 2020

psc-g commented Jul 28, 2020 via email

GoingMyWay commented Aug 1, 2020

RylanSchaeffer commented Aug 4, 2020

GoingMyWay commented Aug 4, 2020 • edited Loading

psc-g commented Aug 4, 2020 via email

GoingMyWay commented Aug 4, 2020

GoingMyWay commented Aug 4, 2020 •

edited

Loading