WandB batch metrics logging error #1290

ivan-chai · 2021-09-10T14:32:11Z

🐛 Bug Report

In wandb all batch metrics are logged as single value per epoch.

Expected behavior

Batch metrics must be logged once per step.

Catalyst version: 21.7

Additional context

The problem is here:

https://github.com/catalyst-team/catalyst/blob/master/catalyst/loggers/wandb.py#L115

Step must be equal to global_sample_step, not global_epoch_step.

github-actions · 2021-09-10T14:32:58Z

Hi! Thank you for your contribution! Please re-check all issue template checklists - unfilled issues would be closed automatically. And do not forget to join our slack for collaboration.

Scitator · 2021-09-10T14:40:28Z

maybe, @AyushExel could help 👀

AyushExel · 2021-09-11T10:18:56Z

@Scitator I'm out on a vacation. I'll have a look at this on monday. But it sounds like we might not have a good solution for this as W&B only has one global step.

AyushExel · 2021-09-14T13:57:30Z

@ivan-chai I remember working on this. the reason that the batch metrics are also logged on epoch level is that W&B supports 1 global step and you cannot edit or log at previous steps, so I decided to log everything on epoch level. There are other methods that will end up dropping data if any previous step is encountered.

ivan-chai · 2021-09-14T15:27:48Z

Thank you for your response! May be it is better to always log sample_step?

Scitator · 2021-09-14T19:03:30Z

yeah, I think, we could just use global_sample_step as a wandb step for all .log

AyushExel · 2021-09-15T11:47:57Z

@Scitator @ivan-chai Is the global_sample_step strictly increaseing? If it is, then yes we can use that

Scitator · 2021-09-20T06:04:26Z

Yes ;)
let's use it

Scitator · 2021-09-20T07:33:47Z

@AyushExel could you please drop a PR with such a small hotfix?
we are going to release 21.09 version soon ;)

AyushExel · 2021-09-20T09:21:43Z

@Scitator sounds good. I'll do it within 2 days

AyushExel · 2021-09-21T14:02:45Z

@Scitator I'm working on it now. I noticed that the logger API now supports artifacts as well. I'll add that too. I just need a bit of clarification on the usage.

    def log_artifact(
        self,
        tag: str,
        artifact: object = None,
        path_to_artifact: str = None,

here, artifact: object is an identifier for the artifact right? Or it can be an in-memory object?
I saw this example usage here

Scitator · 2021-09-21T17:56:41Z

yup, looks like so

artifact: object = None,  - in-memory object
path_to_artifact: str = None, - on-disk object

here is another example -

catalyst/catalyst/loggers/neptune.py

Line 281 in e1d78b7

if artifact is not None and path_to_artifact is not None:

Scitator · 2021-09-21T17:58:03Z

btw, @ditwoo why do we use the profiler in such a way? 😂

AyushExel · 2021-09-22T07:08:32Z

@Scitator in the 2nd example that you linked, it'll raise exception if both artifact and oath_to_artifact is not None. But in the profiler example, both of them are not None so it'll raise an exception. Is this an intended use case?
More specifically, how'd you preferred handling the cases when both of the arguments are set?

Scitator · 2021-09-23T04:34:14Z

@AyushExel yes, and I think it's correct behavior.
The profiler case looks a bit strange in such a case and I think, it's a little bug 😅
@ditwoo could you please help with the profiler initial logic?

AyushExel · 2021-09-24T09:16:34Z

Okay, thanks for the clarification. I'll update the logger with the intended artifacts use case.

AyushExel · 2021-09-29T14:11:05Z

@Scitator I have fixed this and added artifacts support. Can you provide an example training script using catalyst trainer which log artifacts? I want to check all the use cases before submitting the PR. Sorry, I couldn't find something relevant in the quickstart section of docs.

Scitator · 2021-09-29T15:08:31Z

@AyushExel could you please use https://github.com/neptune-ai/examples/blob/main/integrations-and-supported-tools/catalyst/scripts/Neptune_Catalyst_more_options.py ?

Scitator · 2021-09-29T15:09:01Z

btw, we are going to release another version tomorrow, so, the fix would truly welcome ;)

AyushExel · 2021-09-29T15:09:28Z

Just running tests. PR coming soon

ivan-chai added bug Something isn't working help wanted Extra attention is needed labels Sep 10, 2021

ivan-chai assigned bagxi, ditwoo and Scitator Sep 10, 2021

AyushExel mentioned this issue Sep 29, 2021

W&B: add artifacts support and fix logging steps #1309

Merged

10 tasks

Scitator closed this as completed Nov 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WandB batch metrics logging error #1290

WandB batch metrics logging error #1290

ivan-chai commented Sep 10, 2021

github-actions bot commented Sep 10, 2021

Scitator commented Sep 10, 2021

AyushExel commented Sep 11, 2021

AyushExel commented Sep 14, 2021

ivan-chai commented Sep 14, 2021

Scitator commented Sep 14, 2021

AyushExel commented Sep 15, 2021

Scitator commented Sep 20, 2021

Scitator commented Sep 20, 2021

AyushExel commented Sep 20, 2021

AyushExel commented Sep 21, 2021

Scitator commented Sep 21, 2021

Scitator commented Sep 21, 2021

AyushExel commented Sep 22, 2021

Scitator commented Sep 23, 2021

AyushExel commented Sep 24, 2021

AyushExel commented Sep 29, 2021

Scitator commented Sep 29, 2021

Scitator commented Sep 29, 2021

AyushExel commented Sep 29, 2021

WandB batch metrics logging error #1290

WandB batch metrics logging error #1290

Comments

ivan-chai commented Sep 10, 2021

🐛 Bug Report

Expected behavior

Additional context

github-actions bot commented Sep 10, 2021

Scitator commented Sep 10, 2021

AyushExel commented Sep 11, 2021

AyushExel commented Sep 14, 2021

ivan-chai commented Sep 14, 2021

Scitator commented Sep 14, 2021

AyushExel commented Sep 15, 2021

Scitator commented Sep 20, 2021

Scitator commented Sep 20, 2021

AyushExel commented Sep 20, 2021

AyushExel commented Sep 21, 2021

Scitator commented Sep 21, 2021

Scitator commented Sep 21, 2021

AyushExel commented Sep 22, 2021

Scitator commented Sep 23, 2021

AyushExel commented Sep 24, 2021

AyushExel commented Sep 29, 2021

Scitator commented Sep 29, 2021

Scitator commented Sep 29, 2021

AyushExel commented Sep 29, 2021