Initial pass at wandb Ludwig integration #514

vanpelt · 2019-09-06T00:32:48Z

This is initial pass at a basic integration of wandb with Ludwig. It currently mirrors tensorboard events, stores hyperparameters, syncs artifacts, and stores evaluation results.

I'm using the python "black" code formatter which seems to be clashing with some of the existing code. Is there a preferred formatter to use?

CLAassistant · 2019-09-06T00:33:04Z

All committers have signed the CLA.

vanpelt · 2019-09-06T00:37:15Z

Just tried to sign the CLA and it's blank...

w4nderlust · 2019-09-06T04:51:52Z

Just tried to sign the CLA and it's blank...

@briankhsieh can you please look into the CLA issue?

w4nderlust · 2019-09-06T04:53:36Z

This is initial pass at a basic integration of wandb with Ludwig. It currently mirrors tensorboard events, stores hyperparameters, syncs artifacts, and stores evaluation results.

I'm using the python "black" code formatter which seems to be clashing with some of the existing code. Is there a preferred formatter to use?

Thanks for this contribution, will look into it shortly.
As for formatter, we usually use PyCharm's one with the line length set to 79.

bhenhsi · 2019-09-06T14:19:05Z

It looks like cla-assistant.io is having intermittent issues. It worked for me this morning. Can you try again later?

vanpelt · 2019-09-08T17:40:50Z

@briankhsieh got it this time, just took a little while to load.

vanpelt · 2019-09-10T03:52:41Z

@w4nderlust I'm using VSCode, if you have any ideas on settings I'm all ears. I'll do some googling as well. Other than formatting, any thoughts on the pull?

w4nderlust · 2019-09-19T16:47:41Z

@vanpelt sorry for the delay on this, was out for conferences. Will look into it by end of day Friday.

w4nderlust · 2019-10-03T23:37:16Z

ludwig/predict.py

+                save_csv(csv_filename.format(
+                    output_field, output_type), values)


Please format this on 4 lines:

save_csv( csv_filename.format(output_field, output_type), values )

w4nderlust · 2019-10-03T23:38:01Z

ludwig/contribs/wandb.py

+        del config["output_features"]
+        wandb.config.update(config)
+
+    def train_init(self, experiment_directory, experiment_name, model_name, resume, output_directory):


This looks longer than 80 columns, please split accordingly

w4nderlust · 2019-10-03T23:40:11Z

I'm really sorry for the long delay @vanpelt , I had some personal issues that kept me away from working on Ludwig, now I'm back. The PR looks good to me as a starting point, just added a couple formatting suggestions. Regarding VSCode, I don't use it, but likely you can set the linter to 80 columns somehow (note sure this is still valid but https://stackoverflow.com/questions/29968499/vertical-rulers-in-visual-studio-code ). Do you prefer if I merge it now or do you want to keep on working on it adding the missing features?

vanpelt · 2019-10-08T16:25:40Z

Hey @w4nderlust no worries. Let me take a pass at getting formatting setup properly and address your comments. I'll have something shortly.

vanpelt · 2019-10-08T16:42:09Z

Ok, @w4nderlust addressed your feedback. Let me know if I missed anything, I'm still seeing DeepSource issues. Would love to get my linter / formatter setup correctly in vscode but not sure how best to do it.

w4nderlust · 2019-10-10T01:44:22Z

I looked at the deepsource complaints. They are msotly related with global and import.
Doing import more than one time in multiple threads would be a problem?
If not, maybe you could do the same thing I'm doing with the horovod import here: https://github.com/uber/ludwig/blob/master/ludwig/models/model.py#L91-L94 Assigning the imported library as a reference variable in self, so that in the rest of the class of the function you just call self.wandb.
It would look like:

try:
    import wandb
    wandb_obj = Wandb()
    setattr(wandb_obj, 'wandb', wandb)
    return wndb_obj

What do you think?

vanpelt · 2019-10-22T06:55:44Z

Sorry for the delay on this, I'll hopefully get to it this week.

…contrib_wandb

vanpelt · 2019-11-01T00:52:50Z

Ok @w4nderlust finally got to this. Let me know if you need any other changes!

vanpelt · 2019-11-11T21:18:31Z

@w4nderlust Ping

vanpelt · 2019-11-18T23:10:55Z

Hey @w4nderlust any updates here?

w4nderlust · 2019-11-19T00:21:37Z

Checked it out, lgtm, will test it later this week and it everything works fine as expected will merge. please give me some instructions for testing, like an example command or something like that ;)

vanpelt · 2019-11-19T02:06:45Z

Awesome! To test it, create a wandb account here: https://app.wandb.ai then run wandb login on your machine. Then just run a Ludwig job with the —wandb flag.

vanpelt · 2019-12-09T21:47:06Z

@w4nderlust any updates here?

vanpelt · 2019-12-17T17:05:06Z

@w4nderlust I'm sure the holidays are hectic. Would love to get this in, do you have an eta?

w4nderlust · 2020-01-16T21:21:13Z

Hey @w4nderlust I'm having a colleague try the TF2 integration and add a test. Hopefully we'll have something next week.

Thank you, sounds good! Consider the TF2 integration is not there yet, the test would be there to avoid problems when we will do it.

…contrib_wandb

I missed an injection point added in ludwig-ai#514

borisdayma · 2020-01-17T17:49:55Z

The comments should now be addressed.
I just need to do a few manual tests and add an integration test for wandb.

Quick questions:

We currently rely on syncing tensorboard data but it seems to be missing validation/test logs. Is that correct? If so we will just use the train_epoch_end method to log them to W&B
It seems our predict_end is never called. I guess it is not used during training and only for inference on trained models. If so then we will remove the call to predict_end and you don't need to add it to the docs @dsblank
DeepSource does not like a few things and also wants to convert back the methods to static since we don't use self. Can we ignore it?

w4nderlust · 2020-01-17T19:11:24Z

The comments should now be addressed.
I just need to do a few manual tests and add an integration test for wandb.

Thank you!

Quick questions:

We currently rely on syncing tensorboard data but it seems to be missing validation/test logs. Is that correct? If so we will just use the train_epoch_end method to log them to W&B

Yes Currently the tensorboard contains only the batch level measures (that are only about the training set) and none of the epoch level measures we display in the cli. There is an open PR for that #571 that will be merged soon.

It seems our predict_end is never called. I guess it is not used during training and only for inference on trained models. If so then we will remove the call to predict_end and you don't need to add it to the docs @dsblank

That's fine. if other integrations need it we will add it, but if you don't need it we can do without for now.

DeepSource does not like a few things and also wants to convert back the methods to static since we don't use self. Can we ignore it?

It's fine to ignore them, in particular if they are warnings. I would say try to address as many as you can, but only do it if it makes sense. The static method complaint can be ignored for instance.

logging done entirely by syncing tensorboard data

borisdayma · 2020-01-19T02:09:49Z

Except for formatting issues everything should be working.

Here is an example run: https://app.wandb.ai/borisd13/ludwig_mnist/runs/qzu4b5qq?workspace=user-borisd13

Whenever the other metrics are added to tensorboard they should automatically become in sync with W&B.

Let me know if I need to change anything!

w4nderlust · 2020-01-28T22:20:31Z

ludwig/contribs/wandb.py

+                   resume, output_directory):
+        import wandb
+        logger.info("wandb.train_init() called...")
+        wandb.init(project=os.getenv("WANDB_PROJECT", experiment_name),


I would add name=model_name to the init parameters.

Good idea, I was not aware of what model_name was used for initially but now it makes sense to use it as the W&B run name

w4nderlust · 2020-01-28T22:22:55Z

@borisdayma tested it and everything looks fine. Added just one little comment about the model name, as at the moment one gets names that are auto-generated by wandb I believe like "dandy-donkey-2". Once that is done, I'll merge.

borisdayma · 2020-01-30T00:07:08Z

I added it and the runs are now named per model_name, see example:
https://app.wandb.ai/borisd13/ludwig_mnist/runs/vsf46v66?workspace=user-borisd13

Once PR #571 is merged, the visualization will be more useful as validation metrics should also be logged.

w4nderlust · 2020-01-30T01:33:15Z

@borisdayma thank you, will merge that one first, test again ,and then merge this. I Don't expect any problem to happen, but just want to be sure.

w4nderlust · 2020-02-02T02:04:18Z

THanks a lot for your work and your patience. Sorry if it took me longer than I expected to get to this, but in the end we merged it :)

borisdayma · 2020-02-02T02:06:09Z

Thanks, I'll be playing with it!

CharlyWargnier · 2020-06-08T11:24:26Z

Thanks for that great integration guys! Is it now fully working?

w4nderlust · 2020-06-08T21:10:54Z

Yes!

Initial pass at ludwig integration

64c00db

Dont assume wandb.init has been called

52ec945

w4nderlust reviewed Oct 3, 2019

View reviewed changes

Formatting fixes

717245a

vanpelt added 5 commits October 31, 2019 17:38

PR feedback

6161ff2

Fix the deep scanner

44b682d

Final fixes

0b24405

Last major deepsource issue

d437278

Merge branch 'master' of https://github.com/uber/ludwig into feature/…

b3df25b

…contrib_wandb

borisdayma added 3 commits January 16, 2020 15:53

Merge branch 'master' of https://github.com/uber/ludwig into feature/…

0428aed

…contrib_wandb

refactor(wandb): convert static methods to regular methods

e11365e

Merge branch 'master' of https://github.com/uber/ludwig into feature/…

078d2c9

…contrib_wandb

dsblank mentioned this pull request Jan 17, 2020

Added missing predict_end dsblank/ludwig#1

Open

dsblank added a commit to dsblank/ludwig that referenced this pull request Jan 17, 2020

Added docs for missing injection point

e0528da

I missed an injection point added in ludwig-ai#514

dsblank mentioned this pull request Jan 17, 2020

Added docs for missing injection point #613

Merged

docs: fix of formatting of long line

70e1c60

borisdayma added 7 commits January 18, 2020 16:11

docs(wandb): fix formatting

b086647

feat(wandb): predict_end not used anymore

6691c19

logging done entirely by syncing tensorboard data

docs(wandb): document use of wandb contrib

9bfdf2f

test(wandb): add integration test

4a71e88

style(wandb): fix formatting

0285945

fix(wandb): add dependency to test requirements

7568c78

test(wandb): remove instance from contrib_registry

09b559c

w4nderlust reviewed Jan 28, 2020

View reviewed changes

feat(wandb): log name of run from model_name

c796f5b

w4nderlust merged commit e8af86c into ludwig-ai:master Feb 2, 2020

w4nderlust changed the title ~~Initial pass at ludwig integration~~ Initial pass atwandb ludwig integration Apr 2, 2020

w4nderlust changed the title ~~Initial pass atwandb ludwig integration~~ Initial pass at wandb Ludwig integration Apr 7, 2020

		save_csv(csv_filename.format(
		output_field, output_type), values)

Initial pass at wandb Ludwig integration #514

Initial pass at wandb Ludwig integration #514

Conversation

vanpelt commented Sep 6, 2019

CLAassistant commented Sep 6, 2019 • edited

vanpelt commented Sep 6, 2019

w4nderlust commented Sep 6, 2019

w4nderlust commented Sep 6, 2019

bhenhsi commented Sep 6, 2019

vanpelt commented Sep 8, 2019

vanpelt commented Sep 10, 2019

w4nderlust commented Sep 19, 2019

w4nderlust Oct 3, 2019

Choose a reason for hiding this comment

w4nderlust Oct 3, 2019

Choose a reason for hiding this comment

w4nderlust commented Oct 3, 2019

vanpelt commented Oct 8, 2019

vanpelt commented Oct 8, 2019

w4nderlust commented Oct 10, 2019

vanpelt commented Oct 22, 2019

vanpelt commented Nov 1, 2019

vanpelt commented Nov 11, 2019

vanpelt commented Nov 18, 2019

w4nderlust commented Nov 19, 2019

vanpelt commented Nov 19, 2019

vanpelt commented Dec 9, 2019

vanpelt commented Dec 17, 2019

w4nderlust commented Jan 16, 2020

borisdayma commented Jan 17, 2020

w4nderlust commented Jan 17, 2020

borisdayma commented Jan 19, 2020

w4nderlust Jan 28, 2020

Choose a reason for hiding this comment

borisdayma Jan 29, 2020

Choose a reason for hiding this comment

w4nderlust commented Jan 28, 2020

borisdayma commented Jan 30, 2020

w4nderlust commented Jan 30, 2020

w4nderlust commented Feb 2, 2020

borisdayma commented Feb 2, 2020

CharlyWargnier commented Jun 8, 2020

w4nderlust commented Jun 8, 2020

CLAassistant commented Sep 6, 2019 •

edited