Skip to content

Latest commit

 

History

History
38 lines (26 loc) · 1.61 KB

04_data_assets.rst

File metadata and controls

38 lines (26 loc) · 1.61 KB

Azure Data Assets

kedro-azureml adds support for two new datasets that can be used in the Kedro catalog, the AzureMLFileDataSet and the AzureMLPandasDataSet which translate to File/Folder dataset and Tabular dataset respectively in Azure Machine Learning. Both fully support the Azure versioning mechanism and can be used in the same way as any other dataset in Kedro.

Apart from these, kedro-azureml also adds the AzureMLPipelineDataSet which is used to pass data between pipeline nodes when the pipeline is run on Azure ML and the pipeline_data_passing feature is enabled. By default, data is then saved and loaded using the PickleDataSet as underlying dataset. Any other underlying dataset can be used instead by adding a AzureMLPipelineDataSet to the catalog.

All of these can be found under the kedro_azureml.datasets module.

For details on usage, see the API Reference below

API Reference

kedro_azureml.datasets.AzureMLPandasDataSet


kedro_azureml.datasets.AzureMLFileDataSet


kedro_azureml.datasets.AzureMLPipelineDataSet