# Environment Setup

This notebook initializes the infrastructure for the Retail Banking Data Pipeline. It defines global parameters, storage paths for the Medallion architecture, and creates the necessary database schemas.

**Instructions**: run this notebook once at the start of a new environment setup or when resetting paths.

### 1. Parameter Initialization

In [0]:
dbutils.widgets.text("project_name", "dm_retail_fin")

### 2. Path Definitions

In [0]:
project_name = dbutils.widgets.get("project_name")
catalog = "workspace"
schema = f"{project_name}_bronze"
volume_name = "files"

volume_path = f"/Volumes/{catalog}/{schema}/{volume_name}"

paths = {
    "raw_data": f"{volume_path}/raw", 
    "archive_data": f"{volume_path}/archive",
    "checkpoints": f"{volume_path}/checkpoints",
    
    "bronze_db": f"{project_name}_bronze",
    "silver_db": f"{project_name}_silver",
    "gold_db": f"{project_name}_gold"
}

### 3. Database & Volume Creation

In [0]:
spark.sql(f"CREATE DATABASE IF NOT EXISTS {paths['bronze_db']}")
spark.sql(f"CREATE DATABASE IF NOT EXISTS {paths['silver_db']}")
spark.sql(f"CREATE DATABASE IF NOT EXISTS {paths['gold_db']}")

spark.sql(f"CREATE VOLUME IF NOT EXISTS {paths['bronze_db']}.{volume_name}")