# SmolLM Training Setup - Colab Enterprise

This notebook sets up and runs SmolLM training on Colab Enterprise with A100 80GB.


## Step 1: Install Dependencies


In [None]:
# Install core dependencies
!pip install -q torch transformers datasets accelerate wandb pyyaml einops tqdm

# Install Flash Attention (requires CUDA)
!pip install -q flash-attn --no-build-isolation

# Verify GPU
import torch
print(f"GPU: {torch.cuda.get_device_name()}")
print(f"VRAM: {torch.cuda.get_device_properties(0).total_memory / 1e9:.1f} GB")


## Step 2: Mount Google Drive (for checkpoints)


In [None]:
from google.colab import drive
drive.mount('/content/drive')

# Create checkpoint directory
!mkdir -p /content/drive/MyDrive/smol-lm-checkpoints


## Step 3: Clone Repository


In [None]:
# Clone your repo
!git clone https://github.com/kalyaannnn/smolLM.git
%cd smolLM


In [None]:
%cd smolLM
import os, sys
sys.path.append(os.getcwd())

## Step 4: Run Sanity Checks


In [None]:
# Run sanity checks
!python sanity_check.py


## Step 5: Setup W&B (Optional)


In [None]:
# Login to W&B
!wandb login

# Get your API key from: https://wandb.ai/authorize


## Step 6: Start Training


In [None]:
# Start pretraining
# This will run in the background - check W&B for progress

!python train_pretrain.py --config configs/pretrain_160m.yaml
