# Environment setting / 環境設定

In [1]:
import os  # noqa: I001
import sys
from pathlib import Path


# Handle utils.py for Colab
if "COLAB_GPU" in os.environ:
    import urllib.request

    demo_utils_url = (
        "https://raw.githubusercontent.com/nics-tw/petsard/main/demo/demo_utils.py"
    )
    exec(urllib.request.urlopen(demo_utils_url).read().decode("utf-8"))
else:
    # demo_utils.py search for local
    for p in [Path.cwd()] + list(Path.cwd().parents)[:10]:
        utils_path = p / "demo_utils.py"
        if utils_path.exists() and "demo" in str(utils_path):
            sys.path.insert(0, str(p))
            exec(open(utils_path).read())
            break

📂 Current working directory: demo/petsard-yaml/loader-yaml
✅ PETsARD demo_utils loaded. Use quick_setup() to initialize.


## Quick setup / 快速設定: benchmark://

In [2]:
from demo_utils import display_results, display_yaml_info, quick_setup  # noqa: I001
from petsard import Executor  # noqa: I001


is_colab, branch, yaml_path = quick_setup(
    yaml_file=[
        "loading-benchmark-dataset.yaml",
        "loading-benchmark-dataset-with-benchmark-schema.yaml",
    ],
    benchmark_data=[],
    petsard_branch="main",
)

✅ Changed working directory to demo: petsard/demo
   📁 Notebook location: demo/petsard-yaml/loader-yaml/
   🔍 YAML search priority: 
      1. demo/petsard-yaml/loader-yaml/
      2. demo/
   💾 Output files will be saved in: demo/
🚀 PETsARD v1.7.0
📅 2025-10-07 17:50:44 UTC+8
📁 Processing YAML files from subfolder: petsard-yaml/loader-yaml
✅ Found YAML (1/2): petsard/demo/petsard-yaml/loader-yaml/loading-benchmark-dataset.yaml
✅ Found YAML (2/2): petsard/demo/petsard-yaml/loader-yaml/loading-benchmark-dataset-with-benchmark-schema.yaml


# Execution and Result / 執行與結果

## Loading Benchmark Dataset / 載入基準資料集

In [3]:
display_yaml_info(yaml_path[0])
exec = Executor(yaml_path[0])
exec.run()
display_results(exec.get_result())

📋 YAML Configuration Files / YAML 設定檔案

📄 File: loading-benchmark-dataset.yaml
📁 Path: petsard/demo/petsard-yaml/loader-yaml/loading-benchmark-dataset.yaml

⚙️ Configuration content / 設定內容:
----------------------------------------
Loader:
  load_benchmark:
    filepath: benchmark://adult-income
📊 Execution Results / 執行結果

[1] Loader[load_benchmark]
------------------------------------------------------------
📈 DataFrame: 48,842 rows × 15 columns
📋 Showing first 3 rows / 顯示前 3 行:

   age  workclass  fnlwgt   education  educational-num      marital-status         occupation relationship   race gender  capital-gain  capital-loss  hours-per-week native-country income
0   25    Private  226802        11th                7       Never-married  Machine-op-inspct    Own-child  Black   Male             0             0              40  United-States  <=50K
1   38    Private   89814     HS-grad                9  Married-civ-spouse    Farming-fishing      Husband  White   Male             0       

## Loading Benchmark Dataset with Benchmark Schema / 載入基準資料集與基準資料集詮釋資料 

In [4]:
display_yaml_info(yaml_path[1])
exec = Executor(yaml_path[1])
exec.run()
display_results(exec.get_result())

📋 YAML Configuration Files / YAML 設定檔案

📄 File: loading-benchmark-dataset-with-benchmark-schema.yaml
📁 Path: petsard/demo/petsard-yaml/loader-yaml/loading-benchmark-dataset-with-benchmark-schema.yaml

⚙️ Configuration content / 設定內容:
----------------------------------------
Loader:
  load_benchmark_with_schema:
    filepath: benchmark://adult-income
    schema: benchmark://adult-income_schema
📊 Execution Results / 執行結果

[1] Loader[load_benchmark_with_schema]
------------------------------------------------------------
📈 DataFrame: 48,842 rows × 15 columns
📋 Showing first 3 rows / 顯示前 3 行:

   age  workclass  fnlwgt   education  educational-num      marital-status         occupation relationship   race gender  capital-gain  capital-loss  hours-per-week native-country income
0   25    Private  226802        11th                7       Never-married  Machine-op-inspct    Own-child  Black   Male             0             0              40  United-States  <=50K
1   38    Private   89814    