Enhanced rliable eval #1183

MischaPanch · 2024-08-01T17:07:25Z

Rliable eval: multiple extensions

Support for evaluating training runs (previously just test runs were supported)
Improved handling of figures and axes
Allow passing max_env_step
Use min len of all experiments (bugfix, previously it would crash if experiments had different lengths)
Extended the data loading logic such that it can handle data from low-level API experiments

Also some fixes and extensions in loggers

Fixup

1. Support for evaluating training runs 2. Improved handling of figures and axes 3. Allow passing max_env_step 4. Use min len of all experiments (bugfix, previously it would crash if experiments had different lengths)

Minor improvements in typing

codecov-commenter · 2024-08-04T15:01:10Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 19.44444% with 116 lines in your changes missing coverage. Please review.

Project coverage is 84.75%. Comparing base (3c523c8) to head (d622bef).

Files	Patch %	Lines
tianshou/evaluation/rliable_evaluation_hl.py	0.00%	88 Missing ⚠️
tianshou/highlevel/logger.py	34.78%	15 Missing ⚠️
tianshou/utils/logger/wandb.py	45.45%	6 Missing ⚠️
tianshou/evaluation/launcher.py	0.00%	4 Missing ⚠️
tianshou/data/stats.py	50.00%	1 Missing ⚠️
tianshou/utils/logger/base.py	92.30%	1 Missing ⚠️
tianshou/utils/logger/tensorboard.py	66.66%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1183      +/-   ##
==========================================
- Coverage   85.51%   84.75%   -0.77%     
==========================================
  Files         104      104              
  Lines        8874     8976     +102     
==========================================
+ Hits         7589     7608      +19     
- Misses       1285     1368      +83

Flag	Coverage Δ
unittests	`84.75% <19.44%> (-0.77%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

maxhuettenrauch

Changes lgtm, I've just added some comments in case we wanted to expand. Otherwise I'll approve

tianshou/highlevel/logger.py

maxhuettenrauch · 2024-08-05T08:20:06Z

tianshou/utils/logger/wandb.py

@@ -11,6 +12,8 @@
 with contextlib.suppress(ImportError):
    import wandb

+log = logging.getLogger(__name__)
+

 class WandbLogger(BaseLogger):


Will need extra arguments for run grouping

maxhuettenrauch · 2024-08-05T09:18:52Z

tianshou/evaluation/rliable_evaluation_hl.py


+        test_data_found = True
+        train_data_found = True
        if not test_episode_returns or env_step_at_test is None:


We probably need another mechanism here as well. In the unfortunate event of all but the last run having recorded data, env_step_at_test may be None and the RuntimeError below is raised.

I feel like we should rather improve the logging and stats-saving mechanism to ensure this never happens instead of doing more gymnastics here. Would that be possible? We can do that in a separate issue/PR.

tianshou/evaluation/rliable_evaluation_hl.py

…eval # Conflicts: # docs/spelling_wordlist.txt # tianshou/highlevel/experiment.py

tianshou/utils/logger/base.py

MischaPanch · 2024-08-06T14:29:35Z

@maxhuettenrauch Thanks for the logger extension and review! Merging this now

Michael Panchenko added 4 commits August 1, 2024 18:06

TrainingStats: fix for zero-len sequences, fixed an optional type

e1709f0

Fixup

Rliable eval: multiple extensions

e41deca

1. Support for evaluating training runs 2. Improved handling of figures and axes 3. Allow passing max_env_step 4. Use min len of all experiments (bugfix, previously it would crash if experiments had different lengths)

Added WandbLoggerFactory, made config_dict optional

9ceb041

Logging: made restore_logged_data static. Eval: better use of DataScope

c492765

Minor improvements in typing

MischaPanch requested a review from maxhuettenrauch August 1, 2024 17:07

Michael Panchenko and others added 2 commits August 1, 2024 19:28

Spelling [ci skip]

547b626

Merge branch 'master' into feature/enhanced-rliable-eval

bd58804

maxhuettenrauch reviewed Aug 5, 2024

View reviewed changes

Michael Panchenko and others added 9 commits August 5, 2024 12:08

Minor typing and docstrings [ci skip]

f18f4a4

removed unnecessary for loop

242ac97

added finalize method to logger

23d3ef7

extended run grouping for wandb logger

0261005

high level logging with extended grouping

acb97e2

get logger class instead of creating a logger in eval

9b1da27

multi example with wandb logging

b0b737b

Merge branch 'refs/heads/thuml-master' into feature/enhanced-rliable-…

d622bef

…eval # Conflicts: # docs/spelling_wordlist.txt # tianshou/highlevel/experiment.py

fix in multi example with wandb logging

522a128

maxhuettenrauch reviewed Aug 6, 2024

View reviewed changes

tianshou/utils/logger/base.py Show resolved Hide resolved

Copy-paste error in BaseLogger [ci skip]

21a7a3c

MischaPanch force-pushed the feature/enhanced-rliable-eval branch from 11b83ab to 21a7a3c Compare August 6, 2024 14:29

MischaPanch merged commit 0c84ef6 into master Aug 6, 2024

MischaPanch deleted the feature/enhanced-rliable-eval branch August 6, 2024 14:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhanced rliable eval #1183

Enhanced rliable eval #1183

MischaPanch commented Aug 1, 2024 •

edited

Loading

codecov-commenter commented Aug 4, 2024 •

edited

Loading

maxhuettenrauch left a comment

maxhuettenrauch Aug 5, 2024

MischaPanch Aug 5, 2024

maxhuettenrauch Aug 5, 2024

MischaPanch Aug 5, 2024

MischaPanch commented Aug 6, 2024

Enhanced rliable eval #1183

Enhanced rliable eval #1183

Conversation

MischaPanch commented Aug 1, 2024 • edited Loading

codecov-commenter commented Aug 4, 2024 • edited Loading

Codecov Report

maxhuettenrauch left a comment

Choose a reason for hiding this comment

maxhuettenrauch Aug 5, 2024

Choose a reason for hiding this comment

MischaPanch Aug 5, 2024

Choose a reason for hiding this comment

maxhuettenrauch Aug 5, 2024

Choose a reason for hiding this comment

MischaPanch Aug 5, 2024

Choose a reason for hiding this comment

MischaPanch commented Aug 6, 2024

MischaPanch commented Aug 1, 2024 •

edited

Loading

codecov-commenter commented Aug 4, 2024 •

edited

Loading