Use `global_step` as the x-axis for wandb #558

vwxyzjn · 2022-03-05T19:14:55Z

I have marked all applicable categories:
- exception-raising fix
- algorithm implementation fix
- documentation modification
- new feature
I have reformatted the code using make format (required)
I have checked the code using make commit-checks (required)
If applicable, I have mentioned the relevant/related issue(s)
If applicable, I have listed every items in this Pull Request below

Tianshou already supports W&B logging via #426. The current logging solution uses two custom x-axises train/env_step and test/env_step. Such usage might be less desirable because

train/env_step and test/env_step share virtually the same values, so we should use the same key such as global_step; with global_step as the x-axis we can still see the train/reward and test_reward as the y-axis,
it's hard to compare tianshou's experiments with those from SB3 and CleanRL, which have adopted global_step as the common x-axis (see Support experiment tracking with W&B DLR-RM/rl-baselines3-zoo#213).

To help address this issue, this PR uses global_step as the x-axis for wandb logging. Additionally, this PR allows the users to override the default wandb project via environment variables like:

WANDB_PROJECT=myproject python3 atari_dqn.py --task "BreakoutNoFrameskip-v4" --test-num 100 --logger wandb

Alternatives considered

An alternative plan is to remove the WandbLogger altogether and instead use wandb's tensorboard integration like

wandb.init(..., sync_tensorboard=True)

While this is possible, WandbLogger currently does more such as resume training, so removing it is a bit more complicated.

Trinkle23897 · 2022-03-05T22:07:39Z

WandbLogger currently does more such as resume training

TensorboardLogger also has this function. So I think that's fine to create something like

class WandbLogger(TensorboardLogger):
  def __init__(self, *args, **kwargs):
    wandb.init(..., sync_tensorboard=True)
    super().__init__(*args, **kwargs)

vwxyzjn · 2022-03-06T01:35:44Z

WandbLogger currently does more such as resume training

TensorboardLogger also has this function. So I think that's fine to create something like
class WandbLogger(TensorboardLogger):
  def __init__(self, *args, **kwargs):
    wandb.init(..., sync_tensorboard=True)
    super().__init__(*args, **kwargs)

There is an issue with this approach. The SummaryWriter needs to be initialized after the wandb.init(..., sync_tensorboard=True), which requires the refactoring from the TensorboardLogger. Maybe we should revert the changes back?

Trinkle23897 · 2022-03-06T15:00:02Z

How about this:

# logger/wandb_init.py
import wandb
wandb.init(..., sync_tensorboard=True)

# logger/wandb.py
# from tianshou.utils.logger import wandb_init
from tianshou.utils.logger.tensorboard import TensorboardLogger

class WandbLogger(TensorboardLogger):
  pass

# utils/__init__.py
do not import wandb_init here

and in main.py:

...
from tianshou.utils.logger import wandb_init, WandbLogger
...
if __name__ == "__main__":
  ...

vwxyzjn · 2022-03-06T15:02:40Z

This is the way CleanRL does it:

https://github.com/vwxyzjn/cleanrl/blob/0b3f8eae7d07b90a0ee129ffe290bd82e5b57a14/cleanrl/ppo.py#L136-L152

Trinkle23897 · 2022-03-06T15:04:22Z

Yeah, I mean we can replace this functionality with a simple import.

vwxyzjn · 2022-03-06T15:15:23Z

Yeah, I mean we can replace this functionality with a simple import.

XD didn't complete my message. I was writing but it can't be easily applied here because the logger has other utilities like save and resume data.

How about something like

    if args.logger == "wandb":
        logger = WandbLogger(
            save_interval=1,
            name=log_name,
            run_id=args.resume_id,
            config=args,
        )
    writer = SummaryWriter(log_path)
    writer.add_text("args", str(args))
    if args.logger == "wandb":
        logger.load(writer)

and in the logger.load we basically load the TensoboardLogger

vwxyzjn · 2022-03-06T18:08:00Z

Per conversation with @Trinkle23897, the latest code adopts the following style.

    if args.logger == "wandb":
        logger = WandbLogger(
            save_interval=1,
            name=f"{args.task}__{log_name}__{args.seed}__{int(time.time())}",
            run_id=args.resume_id,
            config=args,
        )
    writer = SummaryWriter(log_path)
    writer.add_text("args", str(args))
    if args.logger == "tensorboard":
        logger = TensorboardLogger(writer)
    if args.logger == "wandb":
        logger.load(writer)

https://wandb.ai/costa-huang/tianshou/runs/uktkei7h?workspace=user-costa-huang tracks this run.

codecov-commenter · 2022-03-06T21:31:30Z

Codecov Report

Merging #558 (ac68423) into master (2377f2f) will decrease coverage by 0.03%.
The diff coverage is 81.81%.

@@            Coverage Diff             @@
##           master     #558      +/-   ##
==========================================
- Coverage   93.88%   93.85%   -0.04%     
==========================================
  Files          64       64              
  Lines        4368     4376       +8     
==========================================
+ Hits         4101     4107       +6     
- Misses        267      269       +2

Flag	Coverage Δ
unittests	`93.85% <81.81%> (-0.04%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tianshou/utils/logger/wandb.py	`51.92% <81.81%> (+4.19%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2377f2f...ac68423. Read the comment docs.

vwxyzjn

Everything looks good except I think we should add algo_name to the args variable. Also, do you have a tracked run?

examples/atari/atari_ppo.py

vwxyzjn · 2022-03-06T22:10:59Z

It works now.

vwxyzjn · 2022-03-06T22:11:23Z

All good on my end

* Use `global_step` as the x-axis for wandb * Use Tensorboard SummaryWritter as core with `wandb.init(..., sync_tensorboard=True)` * Update all atari examples with wandb Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>

vwxyzjn added 3 commits March 5, 2022 13:57

Use global_step as the x-axis for wandb

33a87a8

Quick fix

c571a42

set a default wandb project

a895ea0

vwxyzjn added 2 commits March 5, 2022 20:17

Address comments

1c5db32

Test Changes

9b0ff2e

vwxyzjn added 6 commits March 6, 2022 11:36

New changes

8574edc

Quick change

57d5aa4

Add changes

feafe3b

Quick change

a30b58c

Added documentation

f9c7658

Fix CI

720c8a1

Trinkle23897 added 2 commits March 6, 2022 16:11

fix ci

3b698e7

Merge remote-tracking branch 'origin/master' into new-wandb

9e530df

add wandb in examples/atari

0721382

Trinkle23897 previously approved these changes Mar 6, 2022

View reviewed changes

vwxyzjn commented Mar 6, 2022

View reviewed changes

examples/atari/atari_ppo.py Outdated Show resolved Hide resolved

args.algo_name

ac68423

Trinkle23897 dismissed their stale review via ac68423 March 6, 2022 22:06

Trinkle23897 approved these changes Mar 6, 2022

View reviewed changes

Trinkle23897 merged commit df3d7f5 into thu-ml:master Mar 6, 2022

vwxyzjn deleted the new-wandb branch March 6, 2022 23:56

Trinkle23897 mentioned this pull request Mar 29, 2022

add write_flush in tflogger, fix argument passing in wandblogger #581

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `global_step` as the x-axis for wandb #558

Use `global_step` as the x-axis for wandb #558

vwxyzjn commented Mar 5, 2022 •

edited

Trinkle23897 commented Mar 5, 2022 •

edited

vwxyzjn commented Mar 6, 2022 •

edited

Trinkle23897 commented Mar 6, 2022 •

edited

vwxyzjn commented Mar 6, 2022 •

edited

Trinkle23897 commented Mar 6, 2022

vwxyzjn commented Mar 6, 2022 •

edited by Trinkle23897

vwxyzjn commented Mar 6, 2022

codecov-commenter commented Mar 6, 2022 •

edited

vwxyzjn left a comment

vwxyzjn commented Mar 6, 2022

vwxyzjn commented Mar 6, 2022

Use global_step as the x-axis for wandb #558

Use global_step as the x-axis for wandb #558

Conversation

vwxyzjn commented Mar 5, 2022 • edited

Alternatives considered

Trinkle23897 commented Mar 5, 2022 • edited

vwxyzjn commented Mar 6, 2022 • edited

Trinkle23897 commented Mar 6, 2022 • edited

vwxyzjn commented Mar 6, 2022 • edited

Trinkle23897 commented Mar 6, 2022

vwxyzjn commented Mar 6, 2022 • edited by Trinkle23897

vwxyzjn commented Mar 6, 2022

codecov-commenter commented Mar 6, 2022 • edited

Codecov Report

vwxyzjn left a comment

Choose a reason for hiding this comment

vwxyzjn commented Mar 6, 2022

vwxyzjn commented Mar 6, 2022

Use `global_step` as the x-axis for wandb #558

Use `global_step` as the x-axis for wandb #558

vwxyzjn commented Mar 5, 2022 •

edited

Trinkle23897 commented Mar 5, 2022 •

edited

vwxyzjn commented Mar 6, 2022 •

edited

Trinkle23897 commented Mar 6, 2022 •

edited

vwxyzjn commented Mar 6, 2022 •

edited

vwxyzjn commented Mar 6, 2022 •

edited by Trinkle23897

codecov-commenter commented Mar 6, 2022 •

edited