Results of GAIL/BC on Mujoco

Here's the extensive experimental results of applying GAIL/BC on Mujoco environments, including Hopper-v1, Walker2d-v1, HalfCheetah-v1, Humanoid-v1, HumanoidStandup-v1. Every imitator is evaluated with seed to be 0.

Results

Training through iterations

Hoppers-v1

HalfCheetah-v1

Walker2d-v1

Humanoid-v1

HumanoidStandup-v1

For details (e.g., adversarial loss, discriminator accuracy, etc.) about GAIL training, please see here

Determinstic Policy (Set std=0)

	Un-normalized	Normalized
Hopper-v1
HalfCheetah-v1
Walker2d-v1
Humanoid-v1
HumanoidStandup-v1

Stochatic Policy

	Un-normalized	Normalized
Hopper-v1
HalfCheetah-v1
Walker2d-v1
Humanoid-v1
HumanoidStandup-v1

details about GAIL imitator

For all environments, the imitator is trained with 1, 5, 10, 50 trajectories, where each trajectory contains at most 1024 transitions, and seed 0, 1, 2, 3, respectively.

details about the BC imitators

All BC imitators are trained with seed 0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gail-result.md

gail-result.md

Results of GAIL/BC on Mujoco

Results

Training through iterations

Determinstic Policy (Set std=0)

Stochatic Policy

details about GAIL imitator

details about the BC imitators

Files

gail-result.md

Latest commit

History

gail-result.md

File metadata and controls

Results of GAIL/BC on Mujoco

Results

Training through iterations

Determinstic Policy (Set std=0)

Stochatic Policy

details about GAIL imitator

details about the BC imitators