GradientDescentLab – Optimizer Comparison for Housing Price Prediction

This project is a hands-on experimental lab designed to understand how different gradient-based optimizers and learning-rate schedules influence the performance of linear regression models. It focuses on building everything from scratch—loss functions, gradients, optimizers, and schedulers—to analyze convergence speed, training stability, and generalization.

1. Overview

This repository implements a complete workflow for regularized linear regression (MSE + L2).
It provides a reproducible environment to compare:

Batch / Mini-batch / Stochastic Gradient Descent
Optimizers: SGD, Momentum, Adagrads, RMSProp, Adam
Learning-rate schedules: Inverse Time, Step Decay, Exponential, Polynomial, Cosine Annealing, SGDR

2. Key Features

Fully from-scratch implementations (loss, gradients, optimizers, LR schedulers)
Modular structure for rapid experimentation
Consistent logging: train/test loss, best iteration, parameter stability
Visualization tools for loss curves and learning rate evolution
Experiment-driven workflow (each Ex1-xx isolates one technique)

3. Methods & Techniques

Optimizers

SGD
SGD with Momentum
RMSProp
Adam

Training Strategies

Batch Gradient Descent
Mini-batch Gradient Descent
Stochastic Gradient Descent

Learning-Rate Schedules

Fixed learning rate
Inverse Time Decay
Step Decay
Exponential Decay
Polynomial Decay
Cosine Annealing
Cosine Annealing with Warm Restarts (SGDR)

Regularization

L2 (Ridge)

4. Repository Structure

GradientDescentLab/ ├── gdlib/ # Core implementations: optimizers, LR schedulers, utilities ├── notebooks/ # Experiment notebooks (Ex1-xx) ├── images/ # Exported plots and visual results └── README.md

5. How to Run

Install dependencies: pip install -r requirements.txt

Launch Jupyter: jupyter lab

Reproduce an experiment: 1. Open any notebook inside notebooks/ 2. Select optimizer and learning-rate schedule 3. Set hyperparameters (learning rate, decay, batch size, regularization) 4. Run training 5. Visualize the results (loss curves, learning-rate curve)

6. Results Summary (Representative Findings)

The following comparisons are selected from a broader set of 44 controlled experiments.
The groups highlighted here represent the most informative and practically relevant outcomes for understanding optimizer behavior in linear regression training.

Adam + SGD

Combines Adam’s adaptive updates with SGD-style stochasticity
Produces fast early convergence with moderate variance
Useful for examining the trade-off between stability and exploration

Adam + SGDR (Cosine Annealing with Warm Restarts)

Shows clear improvements in escaping shallow minima
Achieves better mid-training generalization compared to fixed learning rates
Demonstrates strong performance in scenarios requiring periodic learning-rate resets

Adam + Mini-batch GD

Provides the most stable and predictable convergence among all tested setups
Reduces gradient noise while retaining efficiency
Achieves consistently strong generalization across experiments

More detailed plots and experiment logs are available in the images/ folder and notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
gdlib		gdlib
images		images
notebooks		notebooks
.gitignore		.gitignore
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GradientDescentLab – Optimizer Comparison for Housing Price Prediction

1. Overview

2. Key Features

3. Methods & Techniques

Optimizers

Training Strategies

Learning-Rate Schedules

Regularization

4. Repository Structure

5. How to Run

6. Results Summary (Representative Findings)

Adam + SGD

Adam + SGDR (Cosine Annealing with Warm Restarts)

Adam + Mini-batch GD

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GradientDescentLab – Optimizer Comparison for Housing Price Prediction

1. Overview

2. Key Features

3. Methods & Techniques

Optimizers

Training Strategies

Learning-Rate Schedules

Regularization

4. Repository Structure

5. How to Run

6. Results Summary (Representative Findings)

Adam + SGD

Adam + SGDR (Cosine Annealing with Warm Restarts)

Adam + Mini-batch GD

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages