Implement supervised behavioral cloning training loop #8

cswinter · 2021-11-23T05:00:22Z

No description provided.

Adds a new `supervised.py` script to enn-ppo which trains a model from samples recorded by another policy. Also makes various improvements to the sample recorder: - add `--eval-capture-samples`/`--eval-capture-logits` options to record samples/logits during eval to a file - add `--eval-on-step-0` arg to enable/disable running eval on the first step - add `--codecraft-only-opponent` to run an eval with only a loaded eval policy against itself (this is slightly hacky, I'm planning to remove all the CodeCraft-specific options later) - include action and observation spaces when recording samples - fix `RaggedBufferBool` getting deserialized to `None` - misc fixes to the `SampleRecorder` and `Trace` Resolves #5, #6, and #8.

Adds a new `supervised.py` script to enn-ppo which trains a model from samples recorded by another policy. Also makes various improvements to the sample recorder: - add `--eval-capture-samples`/`--eval-capture-logits` options to record samples/logits during eval to a file - add `--eval-on-step-0` arg to enable/disable running eval on the first step - add `--codecraft-only-opponent` to run an eval with only a loaded eval policy against itself (this is slightly hacky, I'm planning to remove all the CodeCraft-specific options later) - include action and observation spaces when recording samples - fix `RaggedBufferBool` getting deserialized to `None` - misc fixes to the `SampleRecorder` and `Trace` Resolves entity-neural-network/incubator#5, entity-neural-network/incubator#6, and entity-neural-network/incubator#8.

cswinter mentioned this issue Feb 21, 2022

Behavioral cloning #175

Merged

cswinter linked a pull request Feb 21, 2022 that will close this issue

Behavioral cloning #175

Merged

cswinter closed this as completed in #175 Feb 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement supervised behavioral cloning training loop #8

Implement supervised behavioral cloning training loop #8

cswinter commented Nov 23, 2021

Implement supervised behavioral cloning training loop #8

Implement supervised behavioral cloning training loop #8

Comments

cswinter commented Nov 23, 2021