# Day-66: CNN Architectures

If Day 65 was about learning how CNNs work (convolution, pooling, etc.),
then Day 66 is about seeing those ideas in action through famous architectures that shaped deep learning history.

I hope you're all feeling like true CNN master architects after tackling the Convolution and Pooling layers yesterday! We learned the building blocks, guys!

But here's the thing, I'm telling you: having bricks and mortar isn't enough to build a skyscraper. You need the Blueprint! And in deep learning, that blueprint is the Architecture!

You’ll see how each model improved upon the previous one — becoming deeper, smarter, and more powerful.

We’ll go from:

- LeNet (handwritten digit recognition in 1998)
to

- AlexNet (ImageNet champion of 2012 — the deep learning revolution!)
to

- VGG16/VGG19 (deeper and cleaner architectures used even today).

## Topics Covered

- LeNet — The Foundation of CNNs

- AlexNet — The Model that Changed the Game

- VGG16/19 — Deep Yet Simple

## LeNet (1998): The Grandfather

LeNet was developed by Yann LeCun to read handwritten digits on checks. This was the first model to successfully use the Conv+Pool structure.

- `Analogy`: Think of LeNet like the first-generation digital camera — simple, but revolutionary.
It could look at a handwritten “5” and say — “Yes, that’s a 5!”

Key Insight: Proved the concept of spatial feature learning, demonstrating a simple but complete CNN structure: Conv→Pool→Conv→Pool→FFNN (Dense).

### Structure:

- Input: 32×32 grayscale image

- Two convolution layers + pooling layers

- Two fully connected layers

- Output: 10 digits (0–9)

![image.png](attachment:image.png)

LeNet introduced the concept of local receptive fields, weight sharing, and subsampling — which are still the backbone of CNNs today.

## AlexNet — The Deep Learning Revolution (2012)

Created by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton,
AlexNet shocked the world by winning the ImageNet competition — reducing error by over 10%!

Structure Highlights:

- 5 convolutional layers

- 3 fully connected layers

- ReLU activation

- Dropout to prevent overfitting

![image.png](attachment:image.png)

Key Innovations:

    - Depth: It was the first truly deep CNN (8 layers).

    - ReLU: It replaced Sigmoid/Tanh with ReLU for its hidden layers, solving the Vanishing Gradient Problem and allowing deep nets to train rapidly.

    - GPUs: It utilized two $GPU$s to handle the massive computation, proving that CUDA acceleration was non-negotiable for state-of-the-art vision models.

- `Analogy`:
If LeNet was a small hatchback, AlexNet was a supercar — powerful, fast, and capable of seeing thousands of object types.

## VGG16/VGG19 (2014): Depth Matters

Developed by Karen Simonyan and Andrew Zisserman (Oxford),
VGG (Visual Geometry Group) showed that depth alone can improve accuracy — with small (3×3) filters stacked deeper and deeper.

Structure:

- VGG16 → 16 layers (13 conv + 3 dense)

- VGG19 → 19 layers

- Uses only 3×3 convolutions and 2×2 pooling

- Over 138 million parameters

![image.png](attachment:image.png)

- `Analogy`:
Imagine LEGO blocks 🧱 — simple building units stacked neatly.
That’s how VGG builds depth — one small filter at a time, leading to complex feature extraction.


## Code Example (Keras Implementation)

In [1]:
from tensorflow.keras.applications import VGG16, VGG19
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Flatten

# Load pre-trained VGG16
vgg_model = VGG16(weights='imagenet', include_top=True)

# Display model summary
vgg_model.summary()

# Or use it as a feature extractor
vgg_feature_extractor = VGG16(weights='imagenet', include_top=False, input_shape=(224,224,3))
model = Sequential([
    vgg_feature_extractor,
    Flatten(),
    Dense(256, activation='relu'),
    Dense(10, activation='softmax')
])

model.summary()


A local file was found, but it seems to be incomplete or outdated because the auto file hash does not match the original value of 64373286793e3c8b2b4e3219cbf3544b so we will re-download the data.
Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/vgg16/vgg16_weights_tf_dim_ordering_tf_kernels.h5
[1m553467096/553467096[0m [32m━━━━━━━━━━━━━━━━━━━━[0m[37m[0m [1m458s[0m 1us/step


## Summary of Day 66

Today we travelled through time and mastered the blueprints of deep learning history:

- LeNet introduced the foundational CNN structure.

- AlexNet proved that deep learning and ReLU could win major competitions with GPU power.

- VGG showed us the power of depth by using small, uniform 3×3 kernels stacked together.

You now know how to combine the fundamental Conv and Pool layers into powerful, recognized architectures!

## What’s Next (Day 67 Preview)

Now that we’ve studied how CNN architectures evolved —
tomorrow, we’ll move into one of the most powerful modern techniques in deep learning:

Day 67: Transfer Learning with Pretrained Models

We’ll cover:

    - What is Transfer Learning

    - How to Fine-Tune pretrained CNNs

    - Using ResNet and MobileNet for your own projects

You’ll learn how to use these massive models as feature extractors — and train them on your own dataset even with limited data!