# Baseline Model Training

Train pretrained models **without** SWS compression.

Models:
- **LeNet-300-100** on MNIST
- **LeNet5** on MNIST
- **WRN-16-4** on CIFAR-10

All models saved to `checkpoints/` directory with seed=42 for reproducibility.

---
## 1. LeNet-300-100 Training

In [None]:
!python run_sws.py \
    --preset lenet_300_100 \
    --pretrain-epochs 20 \
    --retrain-epochs 0 \
    --batch-size 128 \
    --num-workers 2 \
    --eval-every 5 \
    --run-name lenet300_baseline \
    --save-dir checkpoints \
    --seed 42

In [None]:
# Copy checkpoint to standard location
!cp checkpoints/lenet300_baseline/mnist_lenet_300_100_pre.pt checkpoints/mnist_lenet_300_100_pre.pt
!echo "✓ Saved to checkpoints/mnist_lenet_300_100_pre.pt"

---
## 2. LeNet5 Training

In [None]:
!python run_sws.py \
    --preset lenet5 \
    --pretrain-epochs 20 \
    --retrain-epochs 0 \
    --batch-size 128 \
    --num-workers 2 \
    --eval-every 5 \
    --run-name lenet5_baseline \
    --save-dir checkpoints \
    --seed 42

In [None]:
# Copy checkpoint to standard location
!cp checkpoints/lenet5_baseline/mnist_lenet5_pre.pt checkpoints/mnist_lenet5_pre.pt
!echo "✓ Saved to checkpoints/mnist_lenet5_pre.pt"

---
## 3. WRN-16-4 Training (CIFAR-10)

In [None]:
!python run_sws.py \
    --preset wrn_16_4 \
    --pretrain-epochs 160 \
    --retrain-epochs 0 \
    --batch-size 128 \
    --num-workers 2 \
    --eval-every 20 \
    --optim-pre sgd \
    --run-name wrn16_baseline \
    --save-dir checkpoints \
    --seed 42

In [None]:
# Copy checkpoint to standard location
!cp checkpoints/wrn16_baseline/cifar10_wrn_16_4_pre.pt checkpoints/cifar10_wrn_16_4_pre.pt
!echo "✓ Saved to checkpoints/cifar10_wrn_16_4_pre.pt"

---
## 4. Verify All Checkpoints

In [None]:
!echo "Pretrained checkpoints:"
!ls -lh checkpoints/*.pt

---
## Summary

All baseline models trained and saved to `checkpoints/`:
- `checkpoints/mnist_lenet_300_100_pre.pt`
- `checkpoints/mnist_lenet5_pre.pt`
- `checkpoints/cifar10_wrn_16_4_pre.pt`

These checkpoints will be used by subsequent notebooks for SWS compression experiments.