Feature/algo eval #1074

maxhuettenrauch · 2024-03-12T14:18:35Z

Changes

Dependencies

New extra "eval"

Api Extension

Experiment and ExperimentConfig now have a name, that can however be overridden when Experiment.run() is called
When building an Experiment from an ExperimentConfig, the user has the option to add info about seeds to the name.
New method in ExperimentConfig called build_default_seeded_experiments
SamplingConfig has an explicit training seed, test_seed is inferred.
New evaluation package for repeating the same experiment with multiple seeds and aggregating the results (important extension!). Currently in alpha state.
Loggers can now restore the logged data into python by using the new restore_logged_data

Breaking Changes

AtariEnvFactory (in examples) now receives explicit train and test seeds
EnvFactoryRegistered now requires an explicit test_seed
BaseLogger.prepare_dict_for_logging is now abstract

- introduced logger manager - loggers can reload logged data from disk

maxhuettenrauch · 2024-03-13T09:38:55Z

@MischaPanch @bordeauxred please have a look

# Conflicts: # examples/mujoco/mujoco_env.py

MischaPanch

Preliminary review, we can have a closer look together later,

One thing to do already is to separate saving plots from plotting, and/or give the user the possibility to configure how plots should be saved

examples/mujoco/mujoco_ppo_hl_multi.py

examples/mujoco/tools.py

tianshou/highlevel/experiment.py

examples/mujoco/mujoco_ppo_hl_multi.py

tianshou/utils/logger/wandb.py

examples/atari/atari_wrapper.py

examples/mujoco/mujoco_env.py

tianshou/utils/logger/pandas_logger.py

tianshou/utils/logger/tensorboard.py

maxhuettenrauch · 2024-04-17T17:39:19Z

I had to use contextlib.suppress in order to make the docs build, but at least tests are passing now. I also linked our fork of rliable in the project dependencies as long as the original does not update the dependency on arch.

MischaPanch · 2024-04-17T20:09:59Z

I also linked our fork of rliable in the project dependencies as long as the original does not update the dependency on arch.

Thanks, that works! Seems like rliable is no longer maintained, my PR bumping the arch version got no attention. Let's see for how long we don't need to touch it again. If it creates further problems, might be worth to move the functionality over to tianshou or somewhere else

MischaPanch · 2024-04-17T20:13:19Z

@maxhuettenrauch as to the contextlib suppression - I don't think that's the best solution. Instead you could adjust the command in the CI, which is currently poetry install --with dev to poetry install --with dev -E evaluation (or something like that).

Note that you can run github actions locally with this, maybe it makes sense to have a poe command for it

maxhuettenrauch · 2024-04-18T08:38:43Z

Thanks for the recommendation, unfortunately I haven't managed to get act to work (getting ::error::Not Found but no idea what is actually not found)

MischaPanch · 2024-04-18T17:06:12Z

Seems like this is ready, right? Gonna run a few tests tomorrow and then merge it

…p-ci]

…lay figures

MischaPanch · 2024-04-20T21:44:40Z

I couldn't make parallel execution work with joblib. It also might not make too much sense on a single machine, so we can consider using ray for parallelization instead. For now the limitations are documented, so I'd merge this.

In my last commits I improved the plots a bit (axis labels were cut off), added more docs, fixed some installation problems and did minor enhancements to the interfaces

Maximilian Huettenrauch added 8 commits March 6, 2024 17:09

added explicit env seeding for train and test envs

95cbfe6

logger updates

32cd3b4

- introduced logger manager - loggers can reload logged data from disk

logger updates

734119e

extend hl experiment builder

5762d2c

add mujoco example with multiple runs and performance plots

6c1bd85

Merge branch 'thuml_master' into feature/algo-eval

f730782

format, type check and small fixes

d9a612a

small fix

a7898b1

Merge branch 'thuml_master' into feature/algo-eval

5259d5f

# Conflicts: # examples/mujoco/mujoco_env.py

MischaPanch reviewed Mar 18, 2024

View reviewed changes

Maximilian Huettenrauch added 6 commits March 25, 2024 10:32

Merge branch 'thuml_master' into feature/algo-eval

516c956

updates

d9a2017

move doc string

2e3f0b5

added matplotlib dependency

85204b1

added pandas dependency

5a3f229

fix pandas dependency

dffe8cd

MischaPanch reviewed Mar 26, 2024

View reviewed changes

tianshou/utils/logger/wandb.py Outdated Show resolved Hide resolved

MischaPanch mentioned this pull request Mar 26, 2024

Draft: for explicit seed mechanism of train seed, specific test seeds #1031

Closed

8 tasks

bordeauxred reviewed Mar 26, 2024

View reviewed changes

examples/atari/atari_wrapper.py Show resolved Hide resolved

bordeauxred reviewed Mar 26, 2024

View reviewed changes

examples/mujoco/mujoco_env.py Show resolved Hide resolved

bordeauxred reviewed Mar 26, 2024

View reviewed changes

tianshou/utils/logger/pandas_logger.py Outdated Show resolved Hide resolved

Maximilian Huettenrauch added 7 commits March 27, 2024 11:38

replace assert with exception in wandb logger

e95fa26

removed name shortener

18d8ffa

restructured and moved RLiableExperimentResult

6d9b697

removed attributes from pandas logger

9055eb5

fixed logger test

ce5fa0d

pleased the mypy gods

9c645ff

added primitive joblib launcher

ec2c5c1

bordeauxred reviewed Mar 27, 2024

View reviewed changes

tianshou/utils/logger/tensorboard.py Outdated Show resolved Hide resolved

Maximilian Huettenrauch added 3 commits April 17, 2024 18:04

fixed rliable dependency and some docs

49f5b12

updated lock file

6146ad2

suppressed ImportError on optional dependencies

7ebcf93

Maximilian Huettenrauch added 2 commits April 18, 2024 10:34

added eval to pytest.yml and removed contextlib suppress

0c8b4df

lint

3b1ec50

Maximilian Huettenrauch added 3 commits April 18, 2024 10:39

Merge branch 'aai-master' into feature/algo-eval

c27b577

install rliable with https

32c8eb1

updated lint_and_docs.yml

19f3fdf

MischaPanch marked this pull request as ready for review April 18, 2024 17:04

Renamed and commented restore_logged_data in TensorboardLogger [ski…

0592b6a

…p-ci]

MischaPanch force-pushed the feature/algo-eval branch from d38d171 to 0592b6a Compare April 20, 2024 13:30

Michael Panchenko added 7 commits April 20, 2024 19:44

Removed old and deprecated BasicLogger

6183f70

Logging: improved typing using recursive type definition

10d1d34

Env: added argparse deps tp eval extra

96e42dc

Experiment: use absolute paths

34d1fec

Rliable eval: added docstring, improved figure layout, option to disp…

9fafe7a

…lay figures

Launcher: don't modify user input, set loky as default backend

b42ad64

Multi-experiment script: run sequentially by default, added docstring

31f40c9

MischaPanch approved these changes Apr 20, 2024

View reviewed changes

MischaPanch enabled auto-merge (squash) April 20, 2024 21:57

MischaPanch disabled auto-merge April 20, 2024 22:30

Merge branch 'master' into feature/algo-eval

edda9af

MischaPanch enabled auto-merge (squash) April 20, 2024 23:22

MischaPanch merged commit ade85ab into thu-ml:master Apr 20, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/algo eval #1074

Feature/algo eval #1074

maxhuettenrauch commented Mar 12, 2024 •

edited by MischaPanch

Loading

maxhuettenrauch commented Mar 13, 2024

MischaPanch left a comment

maxhuettenrauch commented Apr 17, 2024

MischaPanch commented Apr 17, 2024 •

edited

Loading

MischaPanch commented Apr 17, 2024

maxhuettenrauch commented Apr 18, 2024

MischaPanch commented Apr 18, 2024

MischaPanch commented Apr 20, 2024

Feature/algo eval #1074

Feature/algo eval #1074

Conversation

maxhuettenrauch commented Mar 12, 2024 • edited by MischaPanch Loading

Changes

Dependencies

Api Extension

Breaking Changes

maxhuettenrauch commented Mar 13, 2024

MischaPanch left a comment

Choose a reason for hiding this comment

maxhuettenrauch commented Apr 17, 2024

MischaPanch commented Apr 17, 2024 • edited Loading

MischaPanch commented Apr 17, 2024

maxhuettenrauch commented Apr 18, 2024

MischaPanch commented Apr 18, 2024

MischaPanch commented Apr 20, 2024

maxhuettenrauch commented Mar 12, 2024 •

edited by MischaPanch

Loading

MischaPanch commented Apr 17, 2024 •

edited

Loading