description |
---|
Logging and visualizing experiments with neptune.ai |
The Neptune Experiment Tracker is an Experiment Tracker flavor provided with the Neptune-ZenML integration that uses neptune.ai to log and visualize information from your pipeline steps (e.g. models, parameters, metrics).
Neptune is a popular tool that you would normally use in the iterative ML experimentation phase to track and visualize experiment results or as a model registry for your production-ready models. Neptune can also track and visualize the results produced by your automated pipeline runs, as you make the transition towards a more production-oriented workflow.
You should use the Neptune Experiment Tracker:
- if you have already been using neptune.ai to track experiment results for your project and would like to continue doing so as you are incorporating MLOps workflows and best practices in your project through ZenML.
- if you are looking for a more visually interactive way of navigating the results produced from your ZenML pipeline runs (e.g. models, metrics, datasets)
- if you would like to connect ZenML to neptune.ai to share the artifacts and metrics logged by your pipelines with your team, organization, or external stakeholders
You should consider one of the other Experiment Tracker flavors if you have never worked with neptune.ai before and would rather use another experiment tracking tool that you are more familiar with.
The Neptune Experiment Tracker flavor is provided by the Neptune-ZenML integration. You need to install it on your local machine to be able to register the Neptune Experiment Tracker and add it to your stack:
zenml integration install neptune -y
The Neptune Experiment Tracker needs to be configured with the credentials required to connect to Neptune using an API token.
You need to configure the following credentials for authentication to Neptune:
api_token
: API key token of your Neptune account. You can create a free Neptune account here. If left blank, Neptune will attempt to retrieve the token from your environment variables.project
: The name of the project where you're sending the new run, in the form "workspace-name/project-name". If the project is not specified, Neptune will attempt to retrieve it from your environment variables.
{% tabs %} {% tab title="Basic Authentication" %} This option configures the credentials for neptune.ai directly as stack component attributes.
{% hint style="warning" %} This is not recommended for production settings as the credentials won't be stored securely and will be clearly visible in the stack configuration. {% endhint %}
# Register the Neptune experiment tracker
zenml experiment-tracker register neptune_experiment_tracker --flavor=neptune \
--project=<project_name> --api_token=<token>
# Register and set a stack with the new experiment tracker
zenml stack register custom_stack -e neptune_experiment_tracker ... --set
{% endtab %}
{% tab title="ZenML Secret (Recommended)" %} This method requires you to configure a ZenML secret to store the Neptune tracking service credentials securely.
You can create the secret using the zenml secret create
command:
zenml secret create neptune_secret \
--project=<PROJECT>
--api_token=<API_TOKEN>
Once the secret is created, you can use it to configure the neptune
Experiment Tracker:
# Reference the project and api-token in our experiment tracker component
zenml experiment-tracker register neptune_secret \
--flavor=neptune \
--project={{neptune_secret.project}} \
--api_token={{neptune_secret.api_token}}
...
{% hint style="info" %} Read more about ZenML Secrets in the ZenML documentation. {% endhint %} {% endtab %} {% endtabs %}
For more, up-to-date information on the Neptune Experiment Tracker implementation and its configuration, you can have a look at the SDK docs .
To log information from a ZenML pipeline step using the Neptune Experiment Tracker component in the active stack, you need to enable an experiment tracker using the @step
decorator. Then fetch the Neptune run object and use logging capabilities as you would normally do. For example:
import numpy as np
import tensorflow as tf
from neptune_tensorflow_keras import NeptuneCallback
from zenml.integrations.neptune.experiment_trackers.run_state import (
get_neptune_run,
)
from zenml import step
@step(experiment_tracker="<NEPTUNE_TRACKER_STACK_COMPONENT_NAME>")
def tf_trainer(
x_train: np.ndarray,
y_train: np.ndarray,
x_val: np.ndarray,
y_val: np.ndarray,
epochs: int = 5,
lr: float = 0.001
) -> tf.keras.Model:
...
neptune_run = get_neptune_run()
model.fit(
x_train,
y_train,
epochs=epochs,
validation_data=(x_val, y_val),
callbacks=[
NeptuneCallback(run=neptune_run),
],
)
metric = ...
neptune_run["<METRIC_NAME>"] = metric
{% hint style="info" %} Instead of hardcoding an experiment tracker name, you can also use the Client to dynamically use the experiment tracker of your active stack:
from zenml.client import Client
experiment_tracker = Client().active_stack.experiment_tracker
@step(experiment_tracker=experiment_tracker.name)
def tf_trainer(...):
...
{% endhint %}
You can pass a set of tags to the Neptune run by using the NeptuneExperimentTrackerSettings
class, like in the example below:
import numpy as np
import tensorflow as tf
from zenml import step
from zenml.integrations.neptune.experiment_trackers.run_state import (
get_neptune_run,
)
from zenml.integrations.neptune.flavors import NeptuneExperimentTrackerSettings
neptune_settings = NeptuneExperimentTrackerSettings(tags={"keras", "mnist"})
@step(
experiment_tracker="<NEPTUNE_TRACKER_STACK_COMPONENT_NAME>",
settings={
"experiment_tracker.neptune": neptune_settings
}
)
def my_step(
x_test: np.ndarray,
y_test: np.ndarray,
model: tf.keras.Model,
) -> float:
"""Log metadata to Neptune run"""
neptune_run = get_neptune_run()
...
Neptune comes with a web-based UI that you can use to find further details about your tracked experiments. Each pipeline run will be logged as a separate experiment run in Neptune, which you can inspect in the Neptune UI.
You can find the URL of the Neptune run linked to a specific ZenML run printed on the console whenever a Neptune run is initialized.
Check Neptune's docs for further information on how to use this integration and Neptune in general.