You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 27, 2023. It is now read-only.
execute 1-step + stores transition to replay buffer + train agent by sampled transitions
NStepParallelAgent
for A2C-like algorithms
execute N-step in parallel environments + train the policy in an online manner
These 2 divisions are practical but lack flexibility.
E.g., we cannot extend OneStep algorithms to batched-parallel style without rewriting the whole process.
So we should re-define agent hierarchies using some important properties, like
Online/Offline(or use replay buffer or not)
MultiStep/OneStep
Not Parallel/Batch Parallel/ Async Parallel
The text was updated successfully, but these errors were encountered:
Now we have two types of agents
These 2 divisions are practical but lack flexibility.
E.g., we cannot extend
OneStep
algorithms to batched-parallel style without rewriting the whole process.So we should re-define agent hierarchies using some important properties, like
The text was updated successfully, but these errors were encountered: