Azure Data Assets

kedro-azureml adds support for two new datasets that can be used in the Kedro catalog, the AzureMLFileDataSet and the AzureMLPandasDataSet which translate to File/Folder dataset and Tabular dataset respectively in Azure Machine Learning. Both fully support the Azure versioning mechanism and can be used in the same way as any other dataset in Kedro.

Apart from these, kedro-azureml also adds the AzureMLPipelineDataSet which is used to pass data between pipeline nodes when the pipeline is run on Azure ML and the pipeline_data_passing feature is enabled. By default, data is then saved and loaded using the PickleDataSet as underlying dataset. Any other underlying dataset can be used instead by adding a AzureMLPipelineDataSet to the catalog.

All of these can be found under the kedro_azureml.datasets module.

For details on usage, see the API Reference below

API Reference

kedro_azureml.datasets.AzureMLPandasDataSet

kedro_azureml.datasets.AzureMLFileDataSet

kedro_azureml.datasets.AzureMLPipelineDataSet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

04_data_assets.rst

04_data_assets.rst

Azure Data Assets

API Reference

Files

04_data_assets.rst

Latest commit

History

04_data_assets.rst

File metadata and controls

Azure Data Assets

API Reference