i have som question #181

pyc33351 · 2020-08-09T10:28:06Z

in run_scripts/train_baseline.py
Hi, origin paper use A3C to train the agent, but I found in the above file that each agent will be assigned a PPO policy network, so which network will be trained？A3C or PPO?i first time use rllib and ray,i didn’t understand why To set up like this

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i have som question #181

i have som question #181

pyc33351 commented Aug 9, 2020

i have som question #181

i have som question #181

Comments

pyc33351 commented Aug 9, 2020