Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does the DQN fall in the paradigm of decentralized training and decentralized execution. #78

Open
lml519 opened this issue Apr 26, 2020 · 1 comment

Comments

@lml519
Copy link

lml519 commented Apr 26, 2020

Does the DQN fall in the paradigm of decentralized training and decentralized execution. I think it is the alogorithm to combine the Parallel computing with the DTDE. I'm not sure if my idea is right

@merrymercy
Copy link
Collaborator

merrymercy commented Apr 28, 2020

I believe our DQN is in the paradigm of centralized training and decentralized execution.
During training, we collect all trajectories and train a single shared model, so the training is centralized.
During inference, we feed in different observations and agent embeddings, so the execution is decentralized.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants