Adds flytekitplugin.wandb #2405

thomasjpfan · 2024-05-10T00:05:32Z

Tracking issue

Why are the changes needed?

This PR adds a wandb plugin to better integrate with Weights and Biases.

What changes were proposed in this pull request?

This PR proposes a simple wandb_init task decorator that gives the user full control on how the project, entity are named in Weights and Biases. If the id is not given, then the HOSTNAME is used for wandb's run id. Currently, HOSTNAME is set to {.executionName}-{.nodeID}-{.taskRetryAttempt}, which is unique to the node.

How was this patch tested?

This PR adds unit tests and docs to help users enable the feature.

For local testing, I've built an image with this plugin installed at ghcr.io/thomasjpfan/wandb:0.0.4 and ran:

Workflow contents

from flytekit import task, Secret, workflow

from flytekitplugins.wandb import wandb_init

WANDB_PROJECT = "flytekit-wandb-plugin"
WANDB_ENTITY = "username"
SECRET_KEY = "wandb-api-key"
SECRET_GROUP = "wandb-api-group"
wandb_secret = Secret(key=WANDB_SECRET_KEY, group=WANDB_SECRET_GROUP)


@task(
    container_image="ghcr.io/thomasjpfan/wandb:0.0.4",
    secret_requests=[wandb_secret],
)
@wandb_init(
    project=WANDB_PROJECT,
    entity=WANDB_ENTITY,
    secret=wandb_secret,
)
def train() -> float:
    from xgboost import XGBClassifier
    from wandb.integration.xgboost import WandbCallback
    from sklearn.datasets import load_iris
    from sklearn.model_selection import train_test_split

    import wandb

    X, y = load_iris(return_X_y=True)
    X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
    bst = XGBClassifier(
        n_estimators=100,
        objective="binary:logistic",
        callbacks=[WandbCallback(log_model=True)],
    )
    bst.fit(X_train, y_train)

    test_score = bst.score(X_test, y_test)

    # Log custom metrics
    wandb.run.log({"test_score": test_score})
    return test_score


@workflow
def wf():
    train()

Check all the applicable boxes

I updated the documentation accordingly.
All new and existing tests passed.
All commits are signed-off.

Docs link

The associated flytesnacks docs are in: flyteorg/flytesnacks#1673

With multiple runs, this is what shows up in the Weights and Bias UI, the runs are ID with Flyte specific run_ids:

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

pingsutw · 2024-05-13T07:07:59Z

plugins/flytekit-wandb/README.md

+@wandb_init(
+    project=WANDB_PROJECT,
+    entity=WANDB_ENTITY,
+    secret_key=WANDB_SECRET_KEY,


Is it necessary to specify the secret key two times? (@task and @wandb_init)

I agree, you can simply use the one specified in the task. as it will be part of the context

I updated the PR to pass the secret object around:

wandb_secret = Secret(key=WANDB_SECRET_KEY, group=WANDB_SECRET_GROUP) @task( container_image=image, secret_requests=[wandb_secret], ) @wandb_init( project=WANDB_PROJECT, entity=WANDB_ENTITY, secret=wandb_secret, )

We can do something more implicit by pulling the secret request out of the task, but the wandb_init decorator still needs to know the name of the Secret associated with W&B. I prefer the more explicit approach of passing the Secret object around.

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

codecov · 2024-05-13T22:34:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 75.83%. Comparing base (edab1e3) to head (5445bc3).

❗ Current head 5445bc3 differs from pull request most recent head 7516fa3

Please upload reports for the commit 7516fa3 to get more accurate results.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2405      +/-   ##
==========================================
- Coverage   75.85%   75.83%   -0.02%     
==========================================
  Files         181      181              
  Lines       18395    18393       -2     
  Branches     3601     3600       -1     
==========================================
- Hits        13953    13949       -4     
- Misses       3840     3841       +1     
- Partials      602      603       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kumare3 · 2024-05-14T04:52:11Z

This is incredible!

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan · 2024-05-15T21:22:30Z

plugins/flytekit-wandb/README.md

+plugins:
+  logs:
+    dynamic-log-links:
+      - wandb-execution-id:
+          displayName: Weights & Biases
+          templateUris: '{{ .taskConfig.host }}/{{ .taskConfig.entity }}/{{ .taskConfig.project }}/runs/{{ .executionName }}-{{ .nodeId }}-{{ .taskRetryAttempt }}'
+      - wandb-custom-id:
+          displayName: Weights & Biases
+          templateUris: '{{ .taskConfig.host }}/{{ .taskConfig.entity }}/{{ .taskConfig.project }}/runs/{{ .taskConfig.id }}'
+```


Giving Weights and Biases two keys here to support:

Our auto-generated execution id -> Default unique id for Weights and Biases

User provided id -> useful for continue training.

wild-endeavor · 2024-05-15T22:09:29Z

plugins/flytekit-wandb/flytekitplugins/wandb/tracking.py

+from flytekit.core.context_manager import FlyteContextManager
+from flytekit.core.utils import ClassDecorator
+
+wandb = lazy_module("wandb")


why lazy here out of curiosity? wandb is a dependency of this plugin, so if the env can access this file shouldn't it also be able to access wandb?

I saw it as a pattern in the mlflow plugin:

flytekit/plugins/flytekit-mlflow/flytekitplugins/mlflow/tracking.py

Lines 9 to 12 in 76fb7c3

go = lazy_module("plotly.graph_objects")

plotly_subplots = lazy_module("plotly.subplots")

pd = lazy_module("pandas")

mlflow = lazy_module("mlflow")

Personally, I would go with a normal import.

For direct dependencies (which is the case, since wandb is listed in setup.py) we should go with a regular import.

wild-endeavor · 2024-05-15T22:11:31Z

plugins/flytekit-wandb/flytekitplugins/wandb/tracking.py

+
+    def __init__(
+        self,
+        task_function: Optional[Callable] = None,


in the future, do you want to also add P, R here?

Yea, f we want this decorator to type correctly with @task, we'll likely end up needing ParamSpec and TypeVar here.

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com> Signed-off-by: Jan Fiedler <jan@union.ai>

thomasjpfan added 3 commits May 8, 2024 19:42

Adds flytekitplugin.wandb

00ffdab

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

Adds code comment

09e7f08

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

Use HOSTNAME as run id

9f88a9b

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan requested review from wild-endeavor, kumare3, eapolinario, pingsutw, cosmicBboy and samhita-alla as code owners May 10, 2024 00:05

thomasjpfan mentioned this pull request May 10, 2024

Adds wandb example flyteorg/flytesnacks#1673

Merged

thomasjpfan changed the title ~~Wandb feature v2~~ Adds flytekitplugin.wandb May 10, 2024

pingsutw reviewed May 13, 2024

View reviewed changes

Finish wandb before output

5445bc3

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan added 2 commits May 14, 2024 10:26

Pass secret object to task and wandb_init

b0d70c4

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

Update docs

925f2a3

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan commented May 15, 2024

View reviewed changes

wild-endeavor previously approved these changes May 15, 2024

View reviewed changes

Use direct import

56584c9

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan dismissed wild-endeavor’s stale review via 56584c9 May 16, 2024 16:05

Merge remote-tracking branch 'upstream/master' into wandb_feature_v2

7516fa3

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

eapolinario approved these changes May 16, 2024

View reviewed changes

thomasjpfan merged commit 70332db into flyteorg:master May 16, 2024
45 of 46 checks passed

austin362667 pushed a commit to austin362667/flytekit that referenced this pull request May 21, 2024

Adds flytekitplugin.wandb (flyteorg#2405)

d3bfa35

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

austin362667 pushed a commit to austin362667/flytekit that referenced this pull request Jun 15, 2024

Adds flytekitplugin.wandb (flyteorg#2405)

a2743a5

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

austin362667 pushed a commit to austin362667/flytekit that referenced this pull request Jun 15, 2024

Adds flytekitplugin.wandb (flyteorg#2405)

6ffa5b5

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan mentioned this pull request Jun 18, 2024

Adds support for wandb backlinks through FLYTE_EXECUTION_URL #2497

Merged

thomasjpfan mentioned this pull request Jul 3, 2024

Adds comet-ml plugin #2550

Merged

fiedlerNr9 pushed a commit that referenced this pull request Jul 25, 2024

Adds flytekitplugin.wandb (#2405)

94a2f72

Signed-off-by: Thomas J. Fan <thomasjpfan@gmail.com> Signed-off-by: Jan Fiedler <jan@union.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds flytekitplugin.wandb #2405

Adds flytekitplugin.wandb #2405

thomasjpfan commented May 10, 2024 •

edited

Loading

pingsutw May 13, 2024

kumare3 May 14, 2024

thomasjpfan May 14, 2024

codecov bot commented May 13, 2024 •

edited

Loading

kumare3 commented May 14, 2024

thomasjpfan May 15, 2024

wild-endeavor May 15, 2024

thomasjpfan May 15, 2024

eapolinario May 15, 2024

wild-endeavor May 15, 2024

thomasjpfan May 15, 2024

	go = lazy_module("plotly.graph_objects")
	plotly_subplots = lazy_module("plotly.subplots")
	pd = lazy_module("pandas")
	mlflow = lazy_module("mlflow")

Adds flytekitplugin.wandb #2405

Adds flytekitplugin.wandb #2405

Conversation

thomasjpfan commented May 10, 2024 • edited Loading

Tracking issue

Why are the changes needed?

What changes were proposed in this pull request?

How was this patch tested?

Check all the applicable boxes

Docs link

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented May 13, 2024 • edited Loading

Codecov Report

kumare3 commented May 14, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomasjpfan commented May 10, 2024 •

edited

Loading

codecov bot commented May 13, 2024 •

edited

Loading