# Azure Machine Learning Data Explorer

## Before you start

You'll need the latest version of the **azure-ai-ml** package to run the code in this notebook. Run the cell below to verify that it is installed.

> **Note**:
> If the **azure-ai-ml** package is not installed, run `pip install azure-ai-ml` to install it.

In [None]:
pip install azure-ai-ml azure-identity pandas mlflow==2.19.0 azureml-mlflow

In [1]:
pip show azure-ai-ml

Name: azure-ai-ml
Version: 1.26.0
Summary: Microsoft Azure Machine Learning Client Library for Python
Home-page: https://github.com/Azure/azure-sdk-for-python
Author: Microsoft Corporation
Author-email: azuresdkengsysadmins@microsoft.com
License: MIT License
Location: /anaconda/envs/jupyter_env/lib/python3.10/site-packages
Requires: azure-common, azure-core, azure-mgmt-core, azure-monitor-opentelemetry, azure-storage-blob, azure-storage-file-datalake, azure-storage-file-share, colorama, isodate, jsonschema, marshmallow, msrest, pydash, pyjwt, pyyaml, strictyaml, tqdm, typing-extensions
Required-by: 
Note: you may need to restart the kernel to use updated packages.


## Connect to your workspace

With the required SDK packages installed, now you're ready to connect to your workspace.

To connect to a workspace, we need identifier parameters - a subscription ID, resource group name, and workspace name. Since you're working with a compute instance, managed by Azure Machine Learning, you can use the default values to connect to the workspace.

In [2]:
from azure.identity import DefaultAzureCredential, InteractiveBrowserCredential
from azure.ai.ml import MLClient

try:
    credential = DefaultAzureCredential()
    # Check if given credential can get token successfully.
    credential.get_token("https://management.azure.com/.default")
except Exception as ex:
    # Fall back to InteractiveBrowserCredential in case DefaultAzureCredential not work
    credential = InteractiveBrowserCredential()


In [3]:
# Get a handle to workspace
ml_client = MLClient.from_config(credential=credential)

Found the config file in: /config.json


## List all datastores

In [4]:
stores = ml_client.datastores.list()
for store in stores:
    print(store.name)

blob_images_datastore
azureml_globaldatasets
blob_training_data
workspaceworkingdirectory
workspacefilestore
workspaceblobstore
workspaceartifactstore


In [5]:
stores = ml_client.datastores.list()
for store in stores:
    print(store)
    break

account_name: myazuremlwseast229913804
container_name: images-data
credentials: {}
description: Blob Storage for images training data
endpoint: core.windows.net
id: /subscriptions/a647c11c-fe4c-43c4-b672-963b71adab36/resourceGroups/my-machine-learning-rg-eastus2-112024/providers/Microsoft.MachineLearningServices/workspaces/my-azure-ml-ws-eastus2-112024/datastores/blob_images_datastore
name: blob_images_datastore
protocol: https
tags: {}
type: azure_blob



## List all data assets and their references

In [6]:
datasets = ml_client.data.list()
for dataset in datasets:
    print(f"name: {dataset.name}, type: {dataset.type}")

name: diabetes-local, type: uri_file
name: diabetes-datastore-path, type: uri_folder
name: diabetes-table, type: mltable
name: diabetes-training, type: mltable
name: oj-training, type: mltable
name: dataset, type: uri_folder
name: diabetes-data, type: uri_file
name: bankmarketing-retrain-data, type: mltable
name: images-data-asset, type: uri_folder


In [7]:
datasets = ml_client.data.list()

for dataset in datasets:
    print(f"dataset: {dataset}")
    data_asset_reference = ml_client.data.get(dataset.name, version=dataset.latest_version)
    print(f"data_asset_reference: {data_asset_reference}")
    print("------")
    # break

dataset: creation_context:
  created_at: '2024-11-17T11:40:58.186156+00:00'
  created_by: System Administrator
  created_by_type: User
  last_modified_at: '2024-11-17T11:40:58.273106+00:00'
latest_version: '1'
name: diabetes-local
properties: {}
tags: {}
type: uri_file

data_asset_reference: creation_context:
  created_at: '2024-11-17T11:40:58.246186+00:00'
  created_by: System Administrator
  created_by_type: User
  last_modified_at: '2024-11-17T11:40:58.255065+00:00'
description: Data asset pointing to a local file, automatically uploaded to the default
  datastore
id: /subscriptions/a647c11c-fe4c-43c4-b672-963b71adab36/resourceGroups/my-machine-learning-rg-eastus2-112024/providers/Microsoft.MachineLearningServices/workspaces/my-azure-ml-ws-eastus2-112024/data/diabetes-local/versions/1
name: diabetes-local
path: azureml://subscriptions/a647c11c-fe4c-43c4-b672-963b71adab36/resourcegroups/my-machine-learning-rg-eastus2-112024/workspaces/my-azure-ml-ws-eastus2-112024/datastores/workspac