[FR] Automatically split metrics, tags, etc into smaller chunks to avoid request limit error #6049

nzw0301 · 2022-06-12T13:08:33Z

Willingness to contribute

Yes. I would be willing to contribute this feature with guidance from the MLflow community.

Proposal Summary

As in the title, MLflow might be able to avoid request limits by automatically splitting data that contains too many elements into smaller ones when we call mlflow.log_metrics, mlflow.log_params, and mlflow.set_tags (possible other API, too).

Motivation

What is the use case for this feature?

Suppose mlflow.log_metrics case. We would like to save many parameters at once as follows:

import mlflow


metrics = {f"param_{i}": i for i in range(mlflow.utils.validation.MAX_METRICS_PER_BATCH+1)}

# Log a batch of metrics
with mlflow.start_run():
    mlflow.log_metrics(metrics)

However, mlflow yields the following error:

File /opt/homebrew/Caskroom/miniconda/base/envs/optuna/lib/python3.9/site-packages/mlflow/utils/validation.py:296, in _validate_batch_limit(entity_name, limit, length)
    290 if length > limit:
    291     error_msg = (
    292         "A batch logging request can contain at most {limit} {name}. "
    293         "Got {count} {name}. Please split up {name} across multiple requests and try "
    294         "again."
    295     ).format(name=entity_name, count=length, limit=limit)
--> 296     raise MlflowException(error_msg, error_code=INVALID_PARAMETER_VALUE)

MlflowException: A batch logging request can contain at most 1000 metrics. Got 1001 metrics. Please split up metrics across multiple requests and try again.

Why is this use case valuable to support for MLflow users in general?

To avoid this, users need to split metrics into smaller subsets like

from itertools import islice
import mlflow


metrics = {f"param_{i}": i for i in range(mlflow.utils.validation.MAX_METRICS_PER_BATCH+1)}


# Log a batch of metrics
with mlflow.start_run():
    if len(metrics) > mlflow.utils.validation.MAX_METRICS_PER_BATCH:
        it = iter(metrics)
        for _ in range(0, len(metrics), mlflow.utils.validation.MAX_METRICS_PER_BATCH):
            sub_metrics = {k: metrics[k] for k in islice(it, mlflow.utils.validation.MAX_METRICS_PER_BATCH)}
            mlflow.log_metrics(sub_metrics)
    else:
        mlflow.log_metrics(metrics)

I suppose this can handle in mlflow.log_metrics. By doing so, users do not need to care about the number of elements of metrics.

Why is this use case valuable to support for your project(s) or organization?

I'm from optuna, a black-box optimisation framework library, community. Optuna provides an MLFlow callback that enables us to save optimisation results by using MLFlow API. If this feature request can be done by MLFlow side, we do not need the added changes by optuna/optuna#3651.

Why is it currently difficult to achieve this use case?

N/A

Details

No response

What component(s) does this bug affect?

What interface(s) does this bug affect?

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

What language(s) does this bug affect?

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

What integration(s) does this bug affect?

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

The text was updated successfully, but these errors were encountered:

dbczumar · 2022-06-13T07:42:46Z

@nzw0301 This is an excellent idea, and we would be excited to review a PR that implements this capability. Thank you in advance for your contribution!

nzw0301 · 2022-06-13T13:13:43Z

@dbczumar Thank you for your comments! I've sent a PR to resolve this issue as linked above.

nzw0301 added the enhancement New feature or request label Jun 12, 2022

github-actions bot added the area/tracking Tracking service, tracking client APIs, autologging label Jun 12, 2022

nzw0301 mentioned this issue Jun 13, 2022

Automatically split large metrics, tags, or params into smaller chunks to avoid request limit error #6052

Merged

30 tasks

BenWilson2 added this to the MLflow Roadmap milestone Jun 16, 2022

dbczumar closed this as completed in #6052 Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FR] Automatically split metrics, tags, etc into smaller chunks to avoid request limit error #6049

[FR] Automatically split metrics, tags, etc into smaller chunks to avoid request limit error #6049

nzw0301 commented Jun 12, 2022 •

edited

What is the use case for this feature?

Why is this use case valuable to support for MLflow users in general?

Why is this use case valuable to support for your project(s) or organization?

Why is it currently difficult to achieve this use case?

dbczumar commented Jun 13, 2022

nzw0301 commented Jun 13, 2022

[FR] Automatically split metrics, tags, etc into smaller chunks to avoid request limit error #6049

[FR] Automatically split metrics, tags, etc into smaller chunks to avoid request limit error #6049

Comments

nzw0301 commented Jun 12, 2022 • edited

Willingness to contribute

Proposal Summary

Motivation

What is the use case for this feature?

Why is this use case valuable to support for MLflow users in general?

Why is this use case valuable to support for your project(s) or organization?

Why is it currently difficult to achieve this use case?

Details

What component(s) does this bug affect?

What interface(s) does this bug affect?

What language(s) does this bug affect?

What integration(s) does this bug affect?

dbczumar commented Jun 13, 2022

nzw0301 commented Jun 13, 2022

nzw0301 commented Jun 12, 2022 •

edited