# Session 6: Environmental Justice & Vector Data

**Goal:** Analyze the environmental justice implications of data center siting using real demographic and historical data.

Datasets:
1.  **Data Centers**: Locations from Cloud Regions.
2.  **Social Vulnerability Index (SVI)**: CDC's ranking of community vulnerability (Census Tracts).
3.  **Redlining (HOLC)**: Historical maps grading neighborhoods for mortgage risk.

Key concepts:
1.  **Spatial Joins**: Linking indices to infrastructure.
2.  **Historical Context**: Overlapping past policies with current infrastructure.

### Action Item 1: Setup Environment

> **Prompt your Agent:**
> "Import ibis, pandas, anymap, geopandas, and altair. Connect to a local DuckDB instance and load the spatial extension."

## 1. Load Data

We need to load three datasets from the cloud.

**URLs**:
- Data Centers (CSV): `https://s3-west.nrp-nautilus.io/public-datacenters/data_centers.csv`
- SVI 2020 (Parquet): `https://s3-west.nrp-nautilus.io/public-social-vulnerability/svi2020_us_tract.parquet`
- Redlining (Parquet): `https://s3-west.nrp-nautilus.io/public-mappinginequality/mappinginequality.parquet`

### Action Item 2: Load Datasets

> **Prompt your Agent:**
> "Load the Data Centers CSV, SVI Parquet, and Redlining Parquet files into Ibis tables using the provided URLs. For Data Centers, filter for 'United States' and create a geometry column from longitude/latitude. For SVI, filter out any rows where `RPL_THEMES` is negative (missing data)."

## 2. Visualize SVI Layers

The SVI dataset contains a ranking `RPL_THEMES` (0 to 1), where higher values indicate higher vulnerability.

**PMTiles URLs**:
- SVI: `https://s3-west.nrp-nautilus.io/public-social-vulnerability/svi2020_us_tract.pmtiles`
- Redlining: `https://s3-west.nrp-nautilus.io/public-mappinginequality/mappinginequality.pmtiles`

### Action Item 3: Visualize with AnyMap

> **Prompt your Agent:**
> "Create an interactive map using `anymap`. Add the SVI PMTiles as a fill layer (color by `RPL_THEMES`), the Redlining PMTiles as a fill layer (color by `grade`), and the Data Centers as red points. Save the map as '02-ej-analysis-map.html'."

## 3. Spatial Join: Vulnerability at Data Center Locations

We want to find the SVI score of the tract where each data center is located.

### Action Item 4: Spatial Join & Analysis

> **Prompt your Agent:**
> "Perform a spatial join to find which SVI tract each Data Center falls into. Calculate the average SVI (`RPL_THEMES`) for data centers and compare it to the national average. Create an Altair chart comparing the distribution of SVI scores for Data Centers vs a sample of the national data."

## 4. Redlining Overlay

The HOLC redlining maps graded neighborhoods from 'A' (Best) to 'D' (Hazardous - Red).

### Action Item 5: Redlining Analysis

> **Prompt your Agent:**
> "Perform a spatial join between Data Centers and the Redlining data. Count how many data centers fall into each HOLC grade (A, B, C, D). Create a bar chart visualizing these counts."

## Discussion

How does the geography of the cloud align with the geography of inequality? (Note: Many data centers are in newer suburbs (e.g. Ashburn) that post-date redlining maps, so overlaps might be limited to urban interconnection hubs).