Scenario 3: Multiple data scientists working on multiple ML models <br>
MLflow setup: <br>

Tracking server: yes, remote server (EC2). <br>
Backend store: postgresql database. <br>
Artifacts store: s3 bucket. <br>

The experiments can be explored by accessing the remote server.

The exampe uses AWS to host a remote server. In order to run the example you'll need an AWS account. Follow the steps described in the file mlflow_on_aws.md to create a new AWS account and launch the tracking server.

In [1]:
import mlflow
import os

# os.environ["AWS_PROFILE"] = "personal_gmail" # fill in with your AWS profile. More info: https://docs.aws.amazon.com/sdk-for-java/latest/developer-guide/setup.html#setup-credentials

TRACKING_SERVER_HOST = "ec2-34-227-25-167.compute-1.amazonaws.com" # fill in with the public DNS of the EC2 instance
mlflow.set_tracking_uri(f"http://{TRACKING_SERVER_HOST}:5000")

In [2]:
mlflow.get_tracking_uri()

'http://ec2-34-227-25-167.compute-1.amazonaws.com:5000'

In [3]:
mlflow.search_experiments()

[<Experiment: artifact_location='s3://mlflow-artifact-tracking-bucket/1', creation_time=1750181853566, experiment_id='1', last_update_time=1750181853566, lifecycle_stage='active', name='my-experiment-1', tags={}>,
 <Experiment: artifact_location='s3://mlflow-artifact-tracking-bucket/0', creation_time=1750181149745, experiment_id='0', last_update_time=1750181149745, lifecycle_stage='active', name='Default', tags={}>]

In [4]:
from sklearn.linear_model import LogisticRegression
from sklearn.datasets import load_iris
from sklearn.metrics import accuracy_score

mlflow.set_experiment("my-experiment-1")

with mlflow.start_run():

    X, y = load_iris(return_X_y=True)

    params = {"C": 0.1, "random_state": 42}
    mlflow.log_params(params)

    lr = LogisticRegression(**params).fit(X, y)
    y_pred = lr.predict(X)
    mlflow.log_metric("accuracy", accuracy_score(y, y_pred))

    mlflow.sklearn.log_model(lr, artifact_path="models")
    print(f"default artifacts URI: '{mlflow.get_artifact_uri()}'")



default artifacts URI: 's3://mlflow-artifact-tracking-bucket/1/08199fccf74546b29a7f16f196f44de9/artifacts'
🏃 View run stately-sheep-925 at: http://ec2-34-227-25-167.compute-1.amazonaws.com:5000/#/experiments/1/runs/08199fccf74546b29a7f16f196f44de9
🧪 View experiment at: http://ec2-34-227-25-167.compute-1.amazonaws.com:5000/#/experiments/1


In [None]:
mlflow.search_experiments()