bc4rl

Bisimulation Critic for Reinforcement Learning

Usage:

python train.py <algo> <policy> <env> -d <device>

Example:

python train.py bsac BSACMlpPolicy lunarlander -d cuda:0

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
bc4rl		bc4rl
hyperparams		hyperparams
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
eval.py		eval.py
train.py		train.py

Provide feedback