Deep Learning Framework

A personal PyTorch framework for training and evaluating deep learning models on image classification benchmarks. Built to be extended with new architectures over time.

Acknowledgements

This repository structure and implementation logic are based on the Deep Learning Tutorial by the Sabancı University (SU) Intelligent Systems Lab.

Supported Models

Model	Flag	Dataset	Notes
MLP	`--model mlp`	MNIST, CIFAR-10	Configurable depth, ReLU/GELU, BatchNorm, Dropout
CNN	`--model cnn`	MNIST, CIFAR-10	LeNet-style (MNIST) / SimpleCNN with Kaiming init (CIFAR-10)
VGG	`--model vgg`	CIFAR-10	VGG-11/13/16/19 with BatchNorm
ResNet	`--model resnet`	CIFAR-10	Configurable blocks (default: ResNet-18)
ResNet-18 pretrained	`--transfer_mode resizeFreeze`	CIFAR-10	ImageNet weights, resize to 224, frozen backbone, FC only
ResNet-18 pretrained	`--transfer_mode modifyFinetune`	CIFAR-10	ImageNet weights, adapted conv1 for 32×32, full fine-tune
MobileNetV2	`--model mobilenet`	CIFAR-10	Inverted residuals, stride-1 stem for 32×32, Kaiming init

Features

Multi-dataset: MNIST and CIFAR-10 (auto-downloaded)
Training utilities: Adam optimizer, L1 + L2 regularization, early stopping
LR schedulers: StepLR, CosineAnnealingLR
Reproducibility: global seed for random, numpy, torch, and cudnn
GPU support: CUDA / MPS / CPU auto-detection (--device auto)
Plotting (--plot): saves training curves and confusion matrix to plots/
Structured logger (--log): formatted epoch table saved to logs/
Transfer learning: ResNet-18 pretrained with freeze or full fine-tune modes
Knowledge distillation: Hinton KD and teacher_prob — soft + hard loss with temperature scaling (--distill, --distill_mode)
CIFAR-10-C robustness evaluation (--test_cifar10c): tests model across all 19 corruption types × 5 severity levels; saves bar chart and heatmap when --plot is set
AugMix training (--augmix): repeats fine-tuning with AugMix augmentation + Jensen-Shannon consistency loss (CE(clean) + λ·JSD(clean, aug1, aug2)); saves to a separate checkpoint (best_model_augmix.pth) to preserve the vanilla model

Installation

git clone https://github.com/Onurcn93/Deep-Learning.git
cd Deep-Learning
pip install -r requirements.txt

For GPU (CUDA 12.x):

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124

Usage

Train + Test

python main.py --mode both --dataset mnist --model mlp

Key Arguments

Argument	Default	Description
`--mode`	`both`	`train`, `test`, or `both`
`--dataset`	`mnist`	`mnist` or `cifar10`
`--model`	`mlp`	`mlp`, `cnn`, `vgg`, `resnet`, `mobilenet`
`--device`	`auto`	`auto`, `cuda`, `mps`, or `cpu`
`--transfer_mode`	`none`	`none`, `resizeFreeze`, `modifyFinetune`
`--log` / `--no-log`	`True`	Save training log to `logs/`
`--epochs`	`10`	Number of training epochs
`--lr`	`1e-3`	Learning rate
`--batch_size`	`64`	Mini-batch size
`--scheduler`	`step`	`step`, `cosine`, or `none`
`--warmup_epochs`	`0`	Linear LR warmup epochs before cosine decay (0 = disabled)
`--patience`	`0`	Early stopping patience (0 = disabled)
`--weight_decay`	`1e-4`	L2 regularization coefficient
`--l1_lambda`	`0.0`	L1 regularization coefficient
`--label_smoothing`	`0.0`	Label smoothing epsilon for CrossEntropyLoss
`--plot`	`False`	Save training curves and confusion matrix to `plots/`
`--seed`	`42`	Global random seed
`--distill`	`False`	Train with knowledge distillation
`--distill_mode`	`hinton`	`hinton` (soft KL + hard CE) or `teacher_prob` (dynamic label smoothing)
`--teacher_path`	`teachers/resnet_teacher.pth`	Path to saved teacher weights
`--temperature`	`4.0`	Distillation temperature T
`--alpha`	`0.7`	Weight for soft KD loss (1-alpha for hard CE)
`--count_flops`	`False`	Print MACs and param count via ptflops
`--test_cifar10c`	`False`	Evaluate model on all CIFAR-10-C corruptions (19 types × 5 severities)
`--cifar10c_dir`	`data/CIFAR-10-C`	Path to extracted CIFAR-10-C folder containing `.npy` files
`--augmix`	`False`	Train with AugMix augmentation + JSD consistency loss
`--jsd_lambda`	`12.0`	Weight on the JSD consistency term (paper default)
`--augmix_save_path`	`best_model_augmix.pth`	Separate checkpoint path for AugMix-trained model

Model-specific Arguments

MLP:

--hidden_sizes 512 256 128   # hidden layer widths
--dropout 0.3
--activation relu             # relu or gelu

VGG:

--vgg_depth 16                # 11, 13, 16, or 19

ResNet:

--resnet_layers 2 2 2 2       # blocks per stage (default = ResNet-18)

Examples

# MLP on MNIST with GPU and plots
python main.py --mode both --dataset mnist --model mlp \
               --epochs 20 --lr 1e-3 --plot

# ResNet-18 on CIFAR-10 with cosine scheduler and early stopping
python main.py --mode both --dataset cifar10 --model resnet \
               --epochs 50 --lr 1e-3 --scheduler cosine \
               --patience 10 --plot

# VGG-16 on CIFAR-10
python main.py --mode both --dataset cifar10 --model vgg \
               --vgg_depth 16 --epochs 30 --plot

# Transfer learning — freeze backbone, train FC only (resize CIFAR to 224)
python main.py --mode both --dataset cifar10 --transfer_mode resizeFreeze \
               --epochs 10 --batch_size 128 --plot --log

# Transfer learning — adapt first conv for 32x32, fine-tune all layers
python main.py --mode both --dataset cifar10 --transfer_mode modifyFinetune \
               --epochs 10 --batch_size 128 --lr 1e-4 --plot --log

# Knowledge distillation — SimpleCNN student, ResNet teacher
# (copy best ResNet weights to teachers/resnet_teacher.pth first)
python main.py --dataset cifar10 --model cnn --distill \
               --teacher_path teachers/resnet_teacher.pth \
               --temperature 4.0 --alpha 0.7 \
               --epochs 20 --lr 1e-3 --batch_size 64 \
               --scheduler cosine --weight_decay 1e-4 \
               --mode both --plot --count_flops

# CIFAR-10-C robustness evaluation — all 19 corruptions × 5 severities
# (download CIFAR-10-C from https://zenodo.org/record/2535967, extract to data/CIFAR-10-C/)
python main.py --dataset cifar10 --transfer_mode modifyFinetune \
               --mode test --test_cifar10c --plot

# AugMix fine-tuning — same hyperparams as modifyFinetune, adds AugMix + JSD loss
# saves to best_model_augmix.pth (vanilla best_model.pth is preserved)
python main.py --dataset cifar10 --transfer_mode modifyFinetune \
               --epochs 20 --lr 1e-4 --batch_size 64 \
               --scheduler cosine --weight_decay 1e-4 \
               --augmix --mode both --plot

# AugMix model — test on clean + CIFAR-10-C
python main.py --dataset cifar10 --transfer_mode modifyFinetune \
               --augmix --mode test --test_cifar10c --plot

Project Structure

Deep-Learning/
├── main.py           # Entry point: model build, transfer learning, train/test dispatch
├── train.py          # Training loop, validation, LR schedulers, data loaders, transforms
├── test.py           # Test evaluation with per-class accuracy
├── plot.py           # Training curves and confusion matrix (saved to plots/)
├── logger.py         # Structured epoch table logger (terminal + logs/)
├── parameters.py     # Dataclasses and argparse for all hyperparameters
├── pretrained.py     # Standalone pretrained ResNet-18 eval script
├── NN_Visualizer.py  # torchviz architecture graph for MLP
├── models/
│   ├── MLP.py        # Multi-Layer Perceptron
│   ├── CNN.py        # LeNet-style CNN (MNIST) / SimpleCNN (CIFAR-10)
│   ├── VGG.py        # VGG-11/13/16/19
│   ├── ResNet.py     # ResNet with BasicBlock
│   └── MobileNet.py  # MobileNetV2 with stride-1 stem for 32×32
├── teachers/         # Gitignored — place teacher .pth weights here
└── requirements.txt

Requirements

Python 3.9+
PyTorch >= 2.0
torchvision >= 0.15
numpy >= 1.24
matplotlib >= 3.7
ptflops >= 0.7 (for FLOPs counting)
seaborn (optional, for nicer confusion matrix)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Framework

Acknowledgements

Supported Models

Features

Installation

Usage

Train + Test

Key Arguments

Model-specific Arguments

Examples

Project Structure

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
models		models
.gitignore		.gitignore
NN_Visualizer.py		NN_Visualizer.py
README.md		README.md
logger.py		logger.py
main.py		main.py
parameters.py		parameters.py
plot.py		plot.py
pretrained.py		pretrained.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Framework

Acknowledgements

Supported Models

Features

Installation

Usage

Train + Test

Key Arguments

Model-specific Arguments

Examples

Project Structure

Requirements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages