# Environment setting / 環境設定

In [1]:
import os  # noqa: I001
import sys
from pathlib import Path


# Handle utils.py for Colab
if "COLAB_GPU" in os.environ:
    import urllib.request

    demo_utils_url = (
        "https://raw.githubusercontent.com/nics-tw/petsard/main/demo/demo_utils.py"
    )
    exec(urllib.request.urlopen(demo_utils_url).read().decode("utf-8"))
else:
    # demo_utils.py search for local
    for p in [Path.cwd()] + list(Path.cwd().parents)[:10]:
        utils_path = p / "demo_utils.py"
        if utils_path.exists() and "demo" in str(utils_path):
            sys.path.insert(0, str(p))
            exec(open(utils_path).read())
            break

📂 Current working directory: demo/petsard-yaml/evaluator-yaml
✅ PETsARD demo_utils loaded. Use quick_setup() to initialize.


## Quick setup / 快速設定: Evaluation YAML - mpUCCs Privacy Risk Assessment

In [2]:
from demo_utils import display_results, display_yaml_info, quick_setup  # noqa: I001
from petsard import Executor  # noqa: I001


is_colab, branch, yaml_path = quick_setup(
    config_file=[
        "privacy-mpuccs.yaml",
    ],
    benchmark_data=None,
    petsard_branch="main",
)

✅ Changed working directory to demo: petsard/demo
   📁 Notebook location: demo/petsard-yaml/evaluator-yaml/
   🔍 YAML search priority: 
      1. demo/petsard-yaml/evaluator-yaml/
      2. demo/
   💾 Output files will be saved in: demo/
🚀 PETsARD v1.8.0
📅 2025-10-18 19:48:36 UTC+8
🔧 Added to Python path: petsard/demo/petsard-yaml/evaluator-yaml
📁 Processing configuration files from subfolder: petsard-yaml/evaluator-yaml
✅ Found configuration (1/1): petsard/demo/petsard-yaml/evaluator-yaml/privacy-mpuccs.yaml


# Execution and Result / 執行與結果

## mpUCCs / 最大部分唯一欄位組合

In [3]:
display_yaml_info(yaml_path[0])
exec_now = Executor(yaml_path[0])
exec_now.run()
display_results(exec_now.get_result())

📋 YAML Configuration Files / YAML 設定檔案

📄 File: privacy-mpuccs.yaml
📁 Path: petsard/demo/petsard-yaml/evaluator-yaml/privacy-mpuccs.yaml

⚙️ Configuration content / 設定內容:
----------------------------------------
---
Splitter:
  external_split:
    method: custom_data
    filepath:
      ori: benchmark://adult-income_ori
      control: benchmark://adult-income_control
    schema:
      ori: benchmark://adult-income_schema
      control: benchmark://adult-income_schema
Synthesizer:
  external_data:
    method: custom_data
    filepath: benchmark://adult-income_syn
    schema: benchmark://adult-income_schema
Evaluator:
  mpuccs_assessment:
    method: mpuccs
    max_baseline_cols: 2        # Maximum columns to evaluate (default: null = all columns)
    min_entropy_delta: 0.01     # Minimum entropy gain threshold (default: 0.0)
    field_decay_factor: 0.5     # Field combination weighting decay (default: 0.5)
    renyi_alpha: 2.0            # Rényi entropy parameter (default: 2.0)
    nume

Field 15/15 - Processing capital-loss combinations: 100%|██████████| 120/120 [00:31<00:00,  3.76combo/s] 


📊 Execution Results / 執行結果

[1] Splitter[external_split_[1-1]]_Synthesizer[external_data]_Evaluator[mpuccs_assessment]
------------------------------------------------------------
📦 Dictionary with 3 keys / 包含 3 個鍵的字典

  • global: DataFrame (1 rows × 7 columns)
    📋 Showing first 1 rows / 顯示前 1 行:
       privacy_risk_score  overall_protection  main_protection  baseline_protection  identification_rate  total_identified  total_syn_records
    0            0.066337            0.933663         0.933663                  1.0             0.066337              2592              39073
    📝 Columns / 欄位: privacy_risk_score, overall_protection, main_protection, baseline_protection, identification_rate, total_identified, total_syn_records

  • details: DataFrame (2,592 rows × 7 columns)
    📋 Showing first 3 rows / 顯示前 3 行:
      risk_level  syn_idx  ori_idx  combo_size  field_combo value_combo  baseline_protection
    0       high        3    12608           1  ('fnlwgt',)   (208608,)          