Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

i have som question #181

Open
pyc33351 opened this issue Aug 9, 2020 · 0 comments
Open

i have som question #181

pyc33351 opened this issue Aug 9, 2020 · 0 comments

Comments

@pyc33351
Copy link

pyc33351 commented Aug 9, 2020

in run_scripts/train_baseline.py
Hi, origin paper use A3C to train the agent, but I found in the above file that each agent will be assigned a PPO policy network, so which network will be trained?A3C or PPO?i first time use rllib and ray,i didn’t understand why To set up like this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant