# Total Project — Orchestration

_Last updated: 2025-07-30_


**Purpose:** Run the entire pipeline end-to-end (data → features → baseline → attention → eval).  
**Do not** put heavy logic here; call functions from `src/` modules.

### Sections to keep
1. Data preparation (calls functions from `src/data/*`)
2. Feature extraction (`src/vision/*`)
3. Train baseline (`src/models/*`, `src/train/*`)
4. Train attention model (same)
5. Evaluation & qualitative results (`src/eval/*`)
6. (Optional) Demo entry point

**Tip:** Keep cells short. Save plots and logs to files under `artifacts/`.


In [None]:

# Mount Drive (optional)
from google.colab import drive
try:
    drive.mount('/content/drive')
except Exception as e:
    print("Drive mount skipped or failed:", e)

# Clone or update the repo
import os, sys
REPO = "Image-Captioning-Team-P-"
URL  = "https://github.com/kushal1989/" + REPO + ".git"

if not os.path.exists(f"/content/{REPO}"):
    !git clone $URL /content/$REPO
else:
    %cd /content/$REPO
    !git fetch origin
    !git checkout integrate
    !git pull --rebase origin integrate
%cd /content/$REPO

# Autoreload and path for src/
%load_ext autoreload
%autoreload 2
sys.path.append(f"/content/{REPO}/src")

# Load shared config if present
try:
    import yaml, pprint
    with open('configs/project.yaml') as f:
        cfg = yaml.safe_load(f)
    print("Loaded config:")
    pprint.pprint(cfg)
except Exception as e:
    print("Config not found yet or YAML error:", e)


In [None]:
# Start coding below
