## **Transfer learning**

- when we are using pretrained model we must watch to domain and task of our own problem and pretrained model<br>
Full documentation link is here: https://www.learndatasci.com/tutorials/hands-on-transfer-learning-keras/

- github repo link: https://github.com/gabrielcassimiro17/object-detection/blob/main/transfer_learning.ipynb

- **Example :**VGG16 train on imagenet dataset contain 14million hight resolution images and 1000 class/label
Our goal is classification(flower) but pretained model VGG16 cannot contain flower images
- The VGG16 network was not trained to classify different kinds of flowers.

## **Install required libraries**

In [None]:
!pip install tensorflow_datasets tensorflow keras

## **Load dataset from Tensorflow**


In [4]:
import tensorflow_datasets as tfds
import tensorflow as tf
from tensorflow.keras.utils import to_categorical

## Loading images and labels
(train_ds, train_labels), (test_ds, test_labels) = tfds.load(
    "tf_flowers",
    split=["train[:70%]", "train[:30%]"], ## Train test split
    batch_size=-1,
    as_supervised=True,  # Include labels
)

## Resizing images
train_ds = tf.image.resize(train_ds, (150, 150))
test_ds = tf.image.resize(test_ds, (150, 150))

## Transforming labels to correct format
train_labels = to_categorical(train_labels, num_classes=5)
test_labels = to_categorical(test_labels, num_classes=5)

In [6]:
train_labels

array([[0., 0., 1., 0., 0.],
       [0., 0., 0., 1., 0.],
       [0., 0., 0., 1., 0.],
       ...,
       [1., 0., 0., 0., 0.],
       [0., 0., 1., 0., 0.],
       [1., 0., 0., 0., 0.]], dtype=float32)

1.   We use Include_top=False to remove the classification layer that was trained on the ImageNet dataset and set the model as not trainable
2.   Also, we used the preprocess_input function from VGG16 to normalize the input data.



In [7]:
from tensorflow.keras.applications.vgg16 import VGG16
from tensorflow.keras.applications.vgg16 import preprocess_input

##load VGG16 model

base_model = VGG16(weights='imagenet', include_top=False, input_shape=train_ds[0].shape)
base_model.trainable = False   # not trainable weights

# preprocess the input
train_ds = preprocess_input(train_ds)
test_ds = preprocess_input(test_ds)

Downloading data from https://storage.googleapis.com/tensorflow/keras-applications/vgg16/vgg16_weights_tf_dim_ordering_tf_kernels_notop.h5


**check model summary**


In [8]:
base_model.summary()

Model: "vgg16"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
 input_1 (InputLayer)        [(None, 150, 150, 3)]     0         
                                                                 
 block1_conv1 (Conv2D)       (None, 150, 150, 64)      1792      
                                                                 
 block1_conv2 (Conv2D)       (None, 150, 150, 64)      36928     
                                                                 
 block1_pool (MaxPooling2D)  (None, 75, 75, 64)        0         
                                                                 
 block2_conv1 (Conv2D)       (None, 75, 75, 128)       73856     
                                                                 
 block2_conv2 (Conv2D)       (None, 75, 75, 128)       147584    
                                                                 
 block2_pool (MaxPooling2D)  (None, 37, 37, 128)       0     

####**Two main points:**
- the model has over 14 Million trained parameters
- and ends with a maxpooling layer that belongs to the Feature Learning part of the network.

Now we add the last layers for our specific problem.<br>
"cutt-off" --> the fullyconnect layers which are called top model

In [9]:
from tensorflow.keras import layers, models

flatten_layers = layers.Flatten()
dense_layer1 = layers.Dense(50, activation = 'relu')
dense_layer2 = layers.Dense(20, activation = 'relu')
prediction_layers = layers.Dense(5, activation='softmax')


model = models.Sequential([
    base_model,
    flatten_layers,
    dense_layer1,
    dense_layer2,
    prediction_layers
])

**Add compile and fit the model**

In [14]:
from tensorflow.keras.callbacks import EarlyStopping

model.compile(
    optimizer='adam',
    loss = 'categorical_crossentropy',
    metrics=['accuracy']
)


# add EarlyStopping layers--> this layers monitor val_accuracy  if it maximum then stop our model training
# and store weights

early_stopping = EarlyStopping(monitor = 'val_accuracy', mode ="max", patience=5, restore_best_weights=True)



**Strat model training by using callbacks for monitoring**

In [16]:
model.fit(train_ds, train_labels, epochs=10, validation_split=0.2,  callbacks=[early_stopping], batch_size=32)

Epoch 1/10
Epoch 2/10
Epoch 3/10
Epoch 4/10
Epoch 5/10
Epoch 6/10
Epoch 7/10
Epoch 8/10
Epoch 9/10
Epoch 10/10


<keras.src.callbacks.History at 0x79673a97d060>

## **Evaluating this model on test data**

In [17]:
model.evaluate(test_ds, test_labels)



[0.054106153547763824, 0.9854677319526672]

## **Train the CNN model from scratch on the same dataset and compare it with pre-train model**

In [20]:
from tensorflow.keras import Sequential , layers
from tensorflow.keras.callbacks import EarlyStopping  # for monitoring
from tensorflow.keras.layers.experimental.preprocessing import Rescaling

In [23]:
own_model = Sequential()

own_model.add(Rescaling(1./255, input_shape=(150,150,3)))

own_model.add(layers.Conv2D(16, kernel_size=10, activation='relu'))
own_model.add(layers.MaxPooling2D(3))


own_model.add(layers.Conv2D(32, kernel_size=8, activation="relu"))
own_model.add(layers.MaxPooling2D(2))

own_model.add(layers.Conv2D(32, kernel_size=6, activation="relu"))
own_model.add(layers.MaxPooling2D(2))

own_model.add(layers.Flatten())
own_model.add(layers.Dense(50, activation='relu'))
own_model.add(layers.Dense(20, activation='relu'))
own_model.add(layers.Dense(5, activation = 'softmax'))


In [26]:

own_model.compile(
    optimizer='adam',
    loss='categorical_crossentropy',
    metrics=['accuracy'],
)


es = EarlyStopping(monitor='val_accuracy', mode='max', patience=5,  restore_best_weights=True)

own_model.fit(train_ds, train_labels, epochs=50, validation_split=0.2, batch_size=32, callbacks=[es])


Epoch 1/50
Epoch 2/50
Epoch 3/50
Epoch 4/50
Epoch 5/50
Epoch 6/50
Epoch 7/50
Epoch 8/50
Epoch 9/50
Epoch 10/50
Epoch 11/50
Epoch 12/50
Epoch 13/50
Epoch 14/50
Epoch 15/50
Epoch 16/50
Epoch 17/50


<keras.src.callbacks.History at 0x796702ecd600>

## **Again evaluate our own train model that we are train from scratch**

In [27]:
own_model.evaluate(test_ds, test_labels)



[0.42360004782676697, 0.8519527912139893]

- Compare the result of own train model that we made it from scrach and result of using pre-traine model<br>
-The result of pre-trian model is better the model which are made from scratch