🎉 Congrats on mastering Artificial Neural Networks! You've now reached **Part 8 - Section 2** of your deep learning journey: **Convolutional Neural Networks (CNNs)** — the backbone of modern computer vision. 📸🚀


In this part, we’ll build your very first CNN to classify images of **cats and dogs** — a classic deep learning problem.


---


## 👀 How Humans See vs. How CNNs Work


Before diving into code, let's explore **why CNNs exist**.


When we humans look at an image, we detect **features** — eyes, ears, fur, shape, etc. Depending on the features your brain picks up, your interpretation might change (remember the duck or rabbit illusion? 🐇🦆).


CNNs aim to mimic this by learning **features in layers**:
- Low-level: edges, textures
- Mid-level: shapes
- High-level: full objects (dog vs cat)


---


## 🔍 Step-by-Step: How CNNs See an Image


### 🧱 1. **Convolution Layer**
- Applies **filters** to extract patterns.
- Each filter detects specific features like edges or textures.
- The result is a **feature map**.


### ⚡ 2. **ReLU Layer**
- Applies non-linearity to the feature map.
- Helps the network learn complex patterns.


### 🔽 3. **Pooling Layer**
- Reduces image size (downsampling).
- Common method: **Max Pooling** — takes the max value in a window.


### 🧾 4. **Flattening**
- Converts pooled feature maps into a 1D vector.
- This vector is input into the neural network.


### 🔗 5. **Fully Connected Layers**
- Traditional ANN layers.
- Perform the actual classification.


### 🧠 Output
- For binary classification (cat vs dog), we use **1 output neuron** with **Sigmoid activation**.


---


## 🛠️ Building a CNN with TensorFlow


We’ll now build a real CNN with TensorFlow:
- 2 Convolution + Pooling layers.
- 1 Flatten layer.
- 2 Dense (fully connected) layers.


You’ll train it on cat/dog images using `ImageDataGenerator` for image preprocessing and augmentation.


---

### ⭐ Importing the libraries

In [None]:
import tensorflow as tf
from tensorflow.keras.preprocessing.image import ImageDataGenerator

In [None]:
tf.__version__

## ⭐ Part 1 - Data Preprocessing

### Preprocessing the Training set

In [None]:
train_datagen = ImageDataGenerator(rescale = 1./255,
                                   shear_range = 0.2,
                                   zoom_range = 0.2,
                                   horizontal_flip = True)
training_set = train_datagen.flow_from_directory('dataset/training_set',
                                                 target_size = (64, 64),
                                                 batch_size = 32,
                                                 class_mode = 'binary')

### Preprocessing the Test set

In [None]:
test_datagen = ImageDataGenerator(rescale = 1./255)
test_set = test_datagen.flow_from_directory('dataset/test_set',
                                            target_size = (64, 64),
                                            batch_size = 32,
                                            class_mode = 'binary')

## ⭐ Part 2 - Building the CNN



| Layer | Description |
|-------|-------------|
| Conv2D | Extracts features using 32 filters of size 3x3 |
| MaxPooling2D | Downsamples features (pool size 2x2) |
| Conv2D (2nd) | Detects more complex patterns |
| MaxPooling2D | Further reduces size |
| Flatten | Converts to 1D array |
| Dense (128) | Fully connected hidden layer |
| Dense (1) | Output layer with Sigmoid activation |

### Initialising the CNN

In [None]:
cnn = tf.keras.models.Sequential()

### Step 1 - Convolution

In [None]:
cnn.add(tf.keras.layers.Conv2D(filters=32, kernel_size=3, activation='relu', input_shape=[64, 64, 3]))

### Step 2 - Pooling

In [None]:
cnn.add(tf.keras.layers.MaxPool2D(pool_size=2, strides=2))

### Adding a second convolutional layer

In [None]:
cnn.add(tf.keras.layers.Conv2D(filters=32, kernel_size=3, activation='relu'))
cnn.add(tf.keras.layers.MaxPool2D(pool_size=2, strides=2))

### Step 3 - Flattening

In [None]:
cnn.add(tf.keras.layers.Flatten())

### Step 4 - Full Connection

In [None]:
cnn.add(tf.keras.layers.Dense(units=128, activation='relu'))

### Step 5 - Output Layer

In [None]:
cnn.add(tf.keras.layers.Dense(units=1, activation='sigmoid'))

## ⭐ Part 3 - Training the CNN


- **Optimizer**: `adam` — fast and adaptive.
- **Loss function**: `binary_crossentropy` — perfect for binary classification.
- **Metrics**: `accuracy`
- **Epochs**: 25


After training, the model learns to differentiate between cats and dogs with increasing accuracy.



### Compiling the CNN

In [None]:
cnn.compile(optimizer = 'adam', loss = 'binary_crossentropy', metrics = ['accuracy'])

### Training the CNN on the Training set and evaluating it on the Test set

In [None]:
cnn.fit(x = training_set, validation_data = test_set, epochs = 25)

## ⭐ Part 4 - Making a single prediction

In [None]:
import numpy as np
from tensorflow.keras.preprocessing import image
test_image = image.load_img('dataset/single_prediction/cat_or_dog_1.jpg', target_size = (64, 64))
test_image = image.img_to_array(test_image)
test_image = np.expand_dims(test_image, axis = 0)
result = cnn.predict(test_image)
training_set.class_indices
if result[0][0] == 1:
  prediction = 'dog'
else:
  prediction = 'cat'

In [None]:
print(prediction)

---

## ✅ Wrap-Up: What You’ve Learned


🎯 In this section, you:
- Understood how CNNs mimic human vision.
- Learned key concepts: convolution, pooling, flattening, fully connected layers.
- Built a CNN to classify cat and dog images.
- Made your own predictions on new images.