# Data Directory Preparation Notebook (`data_directory.ipynb`)

## Purpose

This notebook **organizes raw flight and weather datasets into a consistent project folder structure** under a local `Data/` directory.  
It copies files from your raw data locations into standardized, model-ready paths.

---

### 1) Imports
- `os` for path building
- `shutil` for file copying

In [1]:
import os, shutil

### 2) Flight data: copy Jan–Mar 2016 CSVs into `Data/Flight_Data/`

For each month in **[1, 2, 3]**, the cell:

1. Builds the **source** file path:
   - `<base>/On_Time_On_Time_Performance_2016_<month>/On_Time_On_Time_Performance_2016_<month>.csv`
2. Copies it into the **destination** path:
   - `Data/Flight_Data/2016_<month>.csv`
3. Creates destination folders if they don’t exist
4. Prints confirmation for each copied file

In [2]:
base = r"C:\Users\bgowt\OneDrive\Documents\AI_Eng_FV_assessment\AI_Eng_FV_assessment\flight_data"
months = [1, 2, 3]

for month in months:
    source = os.path.join(
        base,
        f"On_Time_On_Time_Performance_2016_{month}",
        f"On_Time_On_Time_Performance_2016_{month}.csv"
    )

    destination = os.path.join("Data", "Flight_Data", f"2016_{month}.csv")

    os.makedirs(os.path.dirname(destination), exist_ok=True)
    shutil.copy(source, destination)
    print("Copied:", destination)

Copied: Data\Flight_Data\2016_1.csv
Copied: Data\Flight_Data\2016_2.csv
Copied: Data\Flight_Data\2016_3.csv



### 3) Weather data: copy JSON files for all airports into `Data/Weather_Data/<airport>/`

For each airport in: `ATL, EWR, JFK, LAS, LAX, MCO, MIA, ORD, SEA, SFO`

and each month in **1–3**, this cell:

1. Builds the **source** file path:
   - `<base>/<airport>/2016-<month>.json`
2. Copies it into the **destination** path:
   - `Data/Weather_Data/<airport>/2016_<month>.json`
3. Creates destination folders if they don’t exist
4. Prints confirmation for each copied file

---

In [3]:
base = r"C:\Users\bgowt\OneDrive\Documents\AI_Eng_FV_assessment\AI_Eng_FV_assessment\weather_data"
airports = ("ATL","EWR","JFK","LAS","LAX","MCO","MIA","ORD","SEA","SFO")

for airport in airports:
    for month in range(1, 4):
        source = os.path.join(base, airport, f"2016-{month}.json")
        destination = os.path.join("Data", "Weather_Data", airport, f"2016_{month}.json")

        os.makedirs(os.path.dirname(destination), exist_ok=True)
        shutil.copy(source, destination)
        print("Copied:", destination)

Copied: Data\Weather_Data\ATL\2016_1.json
Copied: Data\Weather_Data\ATL\2016_2.json
Copied: Data\Weather_Data\ATL\2016_3.json
Copied: Data\Weather_Data\EWR\2016_1.json
Copied: Data\Weather_Data\EWR\2016_2.json
Copied: Data\Weather_Data\EWR\2016_3.json
Copied: Data\Weather_Data\JFK\2016_1.json
Copied: Data\Weather_Data\JFK\2016_2.json
Copied: Data\Weather_Data\JFK\2016_3.json
Copied: Data\Weather_Data\LAS\2016_1.json
Copied: Data\Weather_Data\LAS\2016_2.json
Copied: Data\Weather_Data\LAS\2016_3.json
Copied: Data\Weather_Data\LAX\2016_1.json
Copied: Data\Weather_Data\LAX\2016_2.json
Copied: Data\Weather_Data\LAX\2016_3.json
Copied: Data\Weather_Data\MCO\2016_1.json
Copied: Data\Weather_Data\MCO\2016_2.json
Copied: Data\Weather_Data\MCO\2016_3.json
Copied: Data\Weather_Data\MIA\2016_1.json
Copied: Data\Weather_Data\MIA\2016_2.json
Copied: Data\Weather_Data\MIA\2016_3.json
Copied: Data\Weather_Data\ORD\2016_1.json
Copied: Data\Weather_Data\ORD\2016_2.json
Copied: Data\Weather_Data\ORD\2016