[RLlib] Metrics do-over 01: Introducing and testing new MetricsLogger and Stats APIs. #44442

sven1977 · 2024-04-03T07:07:52Z

RLlib Metrics, custom user metrics, and ResultsDict do-over for new API stack:

This PR introduces a new unified API (MetricsLogger) for both RLlib's core codebase and its users to log (custom) metrics from within any(!) component. Thus, whether you are inside Algorithm (callback, overriding training_step, custom eval function, etc..), EnvRunner (callback), or Learner (custom loss?), you can now use the exact same API in all these components for logging your custom metrics.

The new MetricsLogger, which is held by all the above components under the self.metrics property, exposes the following simple API to log stats values:

# Log an individual value. By default, all logged values under the
# same key get mean-reduced once you call `metrics.reduce()`.
metrics.log_value(key="my_loss", value=-0.002)
# By default, mean-reduction happens via EMA (with coeff=0.01). You can change the EMA coeff by doing:
metrics.log_value(key="my_more_recent_loss", value=-0.002, ema_coeff=0.5)
# Or, if you would like to use a sliding window insted of EMA, you can also do::
metrics.log_value(key="my_win50_loss", value=-0.002, window=50)

# Use the same API, but for a lifetime counter. Note that here, we reduce with the "sum" method
metrics.log_value("my_counter", 100, reduce="sum")

There are two situations during which all logged values thus-far will be "reduced" or "merged":

At the end of a component's cycle. For example, if you call EnvRunner.sample(), at the end of this call, the EnvRunner will call the reduce() method on its MetricsLogger object and return the results. Note that this does not necessarily mean that all historic data is reduced at this time. If - for example - you have a stat under the "abc" key with window=1000 and the EnvRunner only logged 50 new values during the sample() call, the previously logged 950 values will still remain in the cache under that key.
After n parallel components (e.g. n EnvRunners) have returned their reduced results, the controlling component (e.g. Algorithm object controlling the n remote EnvRunners) will have to merge the n received result dicts.
This can be achieved with the MetricsLogger of the controlling component:

# inside Algorithm

# Collect n result dicts from n EnvRunners (each dict already reduced by each EnvRunner's own MetricsLogger).
n_result_dicts = self.workers.foreach_worker(lambda env_runner: env_runner.sample())

# Log (and thereby merge) each of the n result dicts using our own MetricsLogger:
self.metrics.log_n_dicts(n_result_dicts, key="env_runner_results")

# Now, the Algorithm's own MetricsLogger object contains all the data from all EnvRunners, in a reduced fashion.

# Let Algorithm return its own (reduced) results, containing all the (reduced/merged) EnvRunner results.
return self.metrics.reduce()

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…nup_examples_folder_03 # Conflicts: # rllib/examples/multi_agent/multi_agent_pendulum.py and wip Signed-off-by: sven1977 <svenmika1977@gmail.com>

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980

LGTM. Great and very appreciated PR. Some nits here and there and a couple of questions. What is missing imo is to show the basic structure of the ResultDict for EnvRunner , Learner, etc. and for the overall ResultDict containing all of them. This is important when putting custom metrics into order - specifically, if a user wants to log them to a specific chart in TensorBoard/WandB.

rllib/algorithms/algorithm.py

rllib/env/single_agent_episode.py

rllib/examples/metrics/custom_metrics_in_env_runners.py

simonsays1980 · 2024-04-17T13:35:38Z

rllib/execution/rollout_ops.py

+        else:
+            for batch_or_episode in sampled_data:
+                if max_agent_steps:
+                    agent_or_env_steps += (


Do we need this, if we return metrics? ANd when don't we want to retrun metrics?

Yeah, unfortunately, we do here in this case, b/c we have to determine right here (before even logging the actual steps to the Algorithm's metrics) when to stop the while loop.

We might rearrange this entire utility, but for now, it works just fine.

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over

Signed-off-by: sven1977 <svenmika1977@gmail.com>

…lete_metrics_and_stats_do_over Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/utils/test_utils.py

sven1977 added 4 commits March 13, 2024 12:50

wip

6f1b505

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

b6e2714

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

4dfb2ce

…lete_metrics_and_stats_do_over

wip

e6402c6

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 requested review from avnishn, ArturNiederfahrenhorst, maxpumperla, kouroshHakha and simonsays1980 as code owners April 3, 2024 07:07

sven1977 added do-not-merge Do not merge this PR! rllib RLlib related issues rllib-newstack labels Apr 3, 2024

sven1977 added 18 commits April 4, 2024 16:39

Merge branch 'master' of https://github.com/ray-project/ray into comp…

33487cc

…lete_metrics_and_stats_do_over

doctest fix

a02abbd

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

d9f3e6e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

e909a73

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

bdaa04c

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

52d9e12

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

f77ffdb

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into clea…

81c4c79

…nup_examples_folder_03 # Conflicts: # rllib/examples/multi_agent/multi_agent_pendulum.py and wip Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

1672675

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

adf9e8c

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

c9e5c2f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

683bc4b

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

5bd220f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

LINT

d931945

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

e9888de

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fixes

0e97d8f

Signed-off-by: sven1977 <svenmika1977@gmail.com>

fixes

7584cce

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

5ba69af

…lete_metrics_and_stats_do_over

sven1977 added 7 commits April 15, 2024 19:22

wip

e7105c3

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

a782d24

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

dcfbaa0

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

f014844

…lete_metrics_and_stats_do_over

wip

e8957f5

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

83ff35b

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

f89a796

Signed-off-by: sven1977 <svenmika1977@gmail.com>

simonsays1980 approved these changes Apr 17, 2024

View reviewed changes

sven1977 added 8 commits April 17, 2024 16:37

wip

429bfab

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

f6aad0c

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

96fefb7

…lete_metrics_and_stats_do_over

WandB logging of videos working!

72b6cd3

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

8133447

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

2682ebb

…lete_metrics_and_stats_do_over

LINT

f1a89f4

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

bde572b

Signed-off-by: sven1977 <svenmika1977@gmail.com>

sven1977 removed the do-not-merge Do not merge this PR! label Apr 18, 2024

sven1977 added 8 commits April 18, 2024 17:12

wip

4be035b

Signed-off-by: sven1977 <svenmika1977@gmail.com>

merge

260e1f5

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

73502dd

…lete_metrics_and_stats_do_over

wip

3bba10e

Signed-off-by: sven1977 <svenmika1977@gmail.com>

wip

525f39c

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

dc0f9a6

…lete_metrics_and_stats_do_over

wip

9dec8d1

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

88bcb83

…lete_metrics_and_stats_do_over

sven1977 changed the title ~~[RLlib] Complete metrics, custom metrics and ResultsDict do-over.~~ [RLlib] Metrics do-over 01: Introducing and testing new MetricsLogger and Stats APIs. Apr 19, 2024

sven1977 added 2 commits April 19, 2024 17:19

wip

17a8e1c

Signed-off-by: sven1977 <svenmika1977@gmail.com>

Merge branch 'master' of https://github.com/ray-project/ray into comp…

cabdf06

…lete_metrics_and_stats_do_over Signed-off-by: sven1977 <svenmika1977@gmail.com> # Conflicts: # rllib/utils/test_utils.py

sven1977 merged commit 054aad6 into ray-project:master Apr 19, 2024
5 checks passed

sven1977 deleted the complete_metrics_and_stats_do_over branch April 20, 2024 07:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Metrics do-over 01: Introducing and testing new MetricsLogger and Stats APIs. #44442

[RLlib] Metrics do-over 01: Introducing and testing new MetricsLogger and Stats APIs. #44442

sven1977 commented Apr 3, 2024 •

edited

simonsays1980 left a comment

simonsays1980 Apr 17, 2024

sven1977 Apr 18, 2024

[RLlib] Metrics do-over 01: Introducing and testing new MetricsLogger and Stats APIs. #44442

[RLlib] Metrics do-over 01: Introducing and testing new MetricsLogger and Stats APIs. #44442

Conversation

sven1977 commented Apr 3, 2024 • edited

Why are these changes needed?

Related issue number

Checks

simonsays1980 left a comment

Choose a reason for hiding this comment

simonsays1980 Apr 17, 2024

Choose a reason for hiding this comment

sven1977 Apr 18, 2024

Choose a reason for hiding this comment

sven1977 commented Apr 3, 2024 •

edited