title	titleSuffix	description	services	author	ms.author	ms.service	ms.subservice	ms.reviewer	ms.topic	ms.date	ms.custom
Inference data collection from models in production	Azure Machine Learning	Collect inference data from models deployed on Azure Machine Learning to monitor their performance in production.	machine-learning	msakande	mopeakande	machine-learning	mlops	alehughes	conceptual	04/15/2024	devplatv2, event-tier1-build-2023, build-2023

Data collection from models in production

[!INCLUDE dev v2]

In this article, you learn about data collection from models that are deployed to Azure Machine Learning online endpoints.

Azure Machine Learning Data collector provides real-time logging of input and output data from models that are deployed to managed online endpoints or Kubernetes online endpoints. Azure Machine Learning stores the logged inference data in Azure blob storage. This data can then be seamlessly used for model monitoring, debugging, or auditing, thereby, providing observability into the performance of your deployed models.

Data collector provides:

Logging of inference data to a central location (Azure Blob Storage)
Support for managed online endpoints and Kubernetes online endpoints
Definition at the deployment level, allowing maximum changes to its configuration
Support for both payload and custom logging

Logging modes

Data collector provides two logging modes: payload logging and custom logging. Payload logging allows you to collect the HTTP request and response payload data from your deployed models. With custom logging, Azure Machine Learning provides you with a Python SDK for logging pandas DataFrames directly from your scoring script. Using the custom logging Python SDK, you can log model input and output data, in addition to data before, during, and after any data transformations (or preprocessing).

Data collector configuration

Data collector can be configured at the deployment level, and the configuration is specified at deployment time. You can configure the Azure Blob storage destination that will receive the collected data. You can also configure the sampling rate (ranging from 0 – 100%) of the data to collect.

Limitations

Data collector has the following limitations:

Data collector only supports logging for online (or real-time) Azure Machine Learning endpoints (Managed or Kubernetes).
The Data collector Python SDK only supports logging tabular data via pandas DataFrames.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

concept-data-collection.md

concept-data-collection.md

Data collection from models in production

Logging modes

Data collector configuration

Limitations

Related content

Files

concept-data-collection.md

Latest commit

History

concept-data-collection.md

File metadata and controls

Data collection from models in production

Logging modes

Data collector configuration

Limitations

Related content