better support for custom environments #20

zach-nervana · 2017-11-05T02:44:07Z

At the moment, the following steps are required to use an environment not included with coach:

implement an environment which matches coach custom environment type
import your environment inside of environments/__init__.py
a new entry should be added to the EnvTypes enum mapping the environment name to the wrapper's class name, also in environments/__init__.py
a new configuration class should be implemented for defining the environment's parameters and added to configurations.py
a new preset should be added to presets.py

these steps are from here: http://coach.nervanasys.com/contributing/add_env/index.html

Many of these steps require modifying coach internal files. In order to track these changes in a version control system, coach itself must be forked and modified. This is inconvenient for multiple reasons:

users can not simply run pip install coach --upgrade to get upgrades.
users must continually rebase their changes on top of the latest coach changes and resolve any merge conflicts that occur.
users may have environments defined in existing projects and git repositories. with the current set up, they will need to continue committing to their existing repositories and also make sure their coach fork matches their environment repository, and make separate commits to the environment wrapper if necessary.

Additionally, this pattern of modifying coaches internals in order to add environments or run experiments is not very new user-friendly.

Here is an example from rllab documentation:

from rllab.algos.trpo import TRPO
from rllab.baselines.linear_feature_baseline import LinearFeatureBaseline
from examples.point_env import PointEnv
from rllab.envs.normalized_env import normalize
from rllab.policies.gaussian_mlp_policy import GaussianMLPPolicy

env = normalize(PointEnv())
policy = GaussianMLPPolicy(
    env_spec=env.spec,
)
baseline = LinearFeatureBaseline(env_spec=env.spec)
algo = TRPO(
    env=env,
    policy=policy,
    baseline=baseline,
)
algo.train()

The user is required to create a new environment wrapper following their environment pattern similar to our step 1. After that however, the user only needs to write a simple python script with a few imports and some configuration.

Looking through coach.py, it looks like it shouldn't be terribly difficult to provide a similar interface, though I am not entirely sure. What do you guys think?

The text was updated successfully, but these errors were encountered:

itaicaspi-intel · 2017-11-07T18:25:54Z

Thanks Zach, I think that this is a good suggestion.
Maybe something along the lines of Gym's register environment?
To allow using Coach within other scripts, it would probably also be good to have a better python interface to Coach, such as the one you cited from rllab.

jtoy · 2017-12-20T03:40:02Z

agree, would love this. I wanted to try using this with https://github.com/jtoy/sensenet as its openai compatible, but there were too many files to modify.

zach-nervana · 2018-01-10T21:36:32Z

I added a new method for specifying which environment to use in the case that the environment follows the interface provided by openai gym. In a preset, you can now specify the environment via: env.level = 'package.module:ClassName'. This means that new environments and be integrated more easily. The only place where coach needs modified to work with new environments now is presets.py. This change was completed in #41.

I will leave this issue open because it would still be nice to configure coach without modifying presets.py. Also, this fix is only helpful in the case that your environment is a gym environment.

jtoy · 2018-01-22T15:42:40Z

@zach-nervana Great, I'll test it out and let you know.

saddy001 · 2018-07-08T16:26:58Z

@zach-nervana could you update the docs (https://nervanasystems.github.io/coach/contributing/add_env/index.html) to reflect your change?

Thanks,

itaicaspi-intel · 2018-08-19T08:43:23Z

thanks @saddy001, it is now updated in https://nervanasystems.github.io/coach/contributing/add_env/index.html as you suggested.
the environment integration process was simplified as well, and we also have a step by step tutorial notebook to accompany the documentation (https://github.com/NervanaSystems/coach/blob/master/tutorials/2.%20Adding%20an%20Environment.ipynb)

itaicaspi-intel added the enhancement label Nov 10, 2017

itaicaspi-intel closed this as completed Aug 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better support for custom environments #20

better support for custom environments #20

zach-nervana commented Nov 5, 2017

itaicaspi-intel commented Nov 7, 2017

jtoy commented Dec 20, 2017 •

edited

zach-nervana commented Jan 10, 2018 •

edited

jtoy commented Jan 22, 2018

saddy001 commented Jul 8, 2018

itaicaspi-intel commented Aug 19, 2018

better support for custom environments #20

better support for custom environments #20

Comments

zach-nervana commented Nov 5, 2017

itaicaspi-intel commented Nov 7, 2017

jtoy commented Dec 20, 2017 • edited

zach-nervana commented Jan 10, 2018 • edited

jtoy commented Jan 22, 2018

saddy001 commented Jul 8, 2018

itaicaspi-intel commented Aug 19, 2018

jtoy commented Dec 20, 2017 •

edited

zach-nervana commented Jan 10, 2018 •

edited