kedro-azureml
adds support for two new datasets that can be used in the Kedro catalog, the AzureMLFileDataSet
and the AzureMLPandasDataSet
which translate to File/Folder dataset and Tabular dataset respectively in Azure Machine Learning. Both fully support the Azure versioning mechanism and can be used in the same way as any other dataset in Kedro.
Apart from these, kedro-azureml
also adds the AzureMLPipelineDataSet
which is used to pass data between pipeline nodes when the pipeline is run on Azure ML and the pipeline_data_passing feature is enabled. By default, data is then saved and loaded using the PickleDataSet
as underlying dataset. Any other underlying dataset can be used instead by adding a AzureMLPipelineDataSet
to the catalog.
All of these can be found under the kedro_azureml.datasets module.
For details on usage, see the API Reference
below
kedro_azureml.datasets.AzureMLPandasDataSet
kedro_azureml.datasets.AzureMLFileDataSet
kedro_azureml.datasets.AzureMLPipelineDataSet