T.S.Varshini : VIT Chennai (20BRS1125) : varshini.ts2020@vitstudent.ac.in : 9003064047

Build a CNN model for Bird species

Bird species classification is the process of using machine learning and computer vision techniques to identify and categorize different species of birds based on their visual characteristics. By analyzing images of birds, models can extract features and patterns to accurately classify bird species. This classification is vital for ecological research, wildlife monitoring, and conservation efforts. Advancements in deep learning and the availability of large annotated datasets have improved the accuracy of bird species classification models. Challenges include variations in lighting, pose, and background clutter. Ongoing research focuses on methods like transfer learning and data augmentation to enhance classification performance and contribute to avian biodiversity understanding and conservation.

Dataset Link: https://www.kaggle.com/datasets/akash2907/bird-species-classification 

In [1]:
import numpy as np
import pandas as pd

In [2]:
# Data Augmentation

from tensorflow.keras.preprocessing.image import ImageDataGenerator

In [3]:
train_gen = ImageDataGenerator(rescale=(1./255),horizontal_flip=True,shear_range=0.2)
test_gen = ImageDataGenerator(rescale=(1./255))  #--> (0 to 255) convert to (0 to 1)

In [4]:
train = train_gen.flow_from_directory('AI_Assignment3/train_data',
                                      target_size=(240, 240),
                                      class_mode='categorical', 
                                      batch_size=8)
test = test_gen.flow_from_directory('AI_Assignment3/test_data',
                                    target_size=(240, 240),
                                      class_mode='categorical', 
                                      batch_size=8)

Found 150 images belonging to 16 classes.
Found 157 images belonging to 16 classes.


In [5]:
train.class_indices

{'blasti': 0,
 'bonegl': 1,
 'brhkyt': 2,
 'cbrtsh': 3,
 'cmnmyn': 4,
 'gretit': 5,
 'hilpig': 6,
 'himbul': 7,
 'himgri': 8,
 'hsparo': 9,
 'indvul': 10,
 'jglowl': 11,
 'lbicrw': 12,
 'mgprob': 13,
 'rebimg': 14,
 'wcrsrt': 15}

In [6]:
# CNN

from tensorflow.keras.layers import Convolution2D,MaxPooling2D,Flatten,Dense
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import BatchNormalization, Dropout

In [7]:
# Initializing the seq model
model = Sequential()
# Adding conv layer with input
model.add(Convolution2D(24,(3,3),activation='relu',input_shape=(240, 240, 3)))
# Normalizing the conv layer output
model.add(BatchNormalization())
# Selecting the max values
model.add(MaxPooling2D(pool_size=(2,2)))
# Dropping the unwanted 20% of data
model.add(Dropout(0.2))

model.add(Convolution2D(24,(3,3),activation='relu'))
model.add(BatchNormalization())
model.add(MaxPooling2D(pool_size=(2,2)))
model.add(Dropout(0.1))

model.add(Convolution2D(36,(3,3),activation='relu'))
model.add(BatchNormalization())
model.add(MaxPooling2D(pool_size=(2,2)))
model.add(Flatten())

# Hiddern layers
model.add(Dense(62,activation='relu'))
model.add(Dense(32,activation='relu'))
model.add(Dense(16,activation='relu'))
# Output layer
model.add(Dense(16,activation='softmax'))
     

In [8]:
model.summary()

Model: "sequential"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
 conv2d (Conv2D)             (None, 238, 238, 24)      672       
                                                                 
 batch_normalization (BatchN  (None, 238, 238, 24)     96        
 ormalization)                                                   
                                                                 
 max_pooling2d (MaxPooling2D  (None, 119, 119, 24)     0         
 )                                                               
                                                                 
 dropout (Dropout)           (None, 119, 119, 24)      0         
                                                                 
 conv2d_1 (Conv2D)           (None, 117, 117, 24)      5208      
                                                                 
 batch_normalization_1 (Batc  (None, 117, 117, 24)     9

In [9]:
model.compile(optimizer='adam',loss='categorical_crossentropy',metrics=['accuracy'])

In [10]:
model.fit(train,batch_size=8,validation_data=test,epochs=50)

Epoch 1/50
Epoch 2/50
Epoch 3/50
Epoch 4/50
Epoch 5/50
Epoch 6/50
Epoch 7/50
Epoch 8/50
Epoch 9/50
Epoch 10/50
Epoch 11/50
Epoch 12/50
Epoch 13/50
Epoch 14/50
Epoch 15/50
Epoch 16/50
Epoch 17/50
Epoch 18/50
Epoch 19/50
Epoch 20/50
Epoch 21/50
Epoch 22/50
Epoch 23/50
Epoch 24/50
Epoch 25/50
Epoch 26/50
Epoch 27/50
Epoch 28/50
Epoch 29/50
Epoch 30/50
Epoch 31/50
Epoch 32/50
Epoch 33/50
Epoch 34/50
Epoch 35/50
Epoch 36/50
Epoch 37/50
Epoch 38/50
Epoch 39/50
Epoch 40/50
Epoch 41/50
Epoch 42/50
Epoch 43/50
Epoch 44/50
Epoch 45/50
Epoch 46/50
Epoch 47/50
Epoch 48/50
Epoch 49/50
Epoch 50/50


<keras.callbacks.History at 0x1fd332b5750>

In [11]:
# Testing

import numpy as np
from tensorflow.keras.preprocessing import image

In [48]:
# Testing 1 Class - 'himgri'
img1 = image.load_img('AI_Assignment3/test_data/himgri/IMG_5384.JPG',target_size=(240,240))
img1 = image.img_to_array(img1)
img1 = np.expand_dims(img1,axis=0)
pred1 = np.argmax(model.predict(img1))
print(pred1)
img1_output = ['blasti','bonegl','brhkyt','cbrtsh',
               'cmnmyn','gretit','hilpig','himbul',
               'himgri','hsparo','indvul','jglowl',
               'lbicrw','mgprob','rebimg','wcrsrt']
print(img1_output[pred1])

8
himgri


In [58]:
# Testing 2 Class - 'cmnmyn'
img2 = image.load_img('AI_Assignment3/test_data/cmnmyn/DSC_5137.jpg',target_size=(240,240))
img2 = image.img_to_array(img2)
img2 = np.expand_dims(img2,axis=0)
pred2 = np.argmax(model.predict(img2))
print(pred2)
img2_output = ['blasti','bonegl','brhkyt','cbrtsh',
               'cmnmyn','gretit','hilpig','himbul',
               'himgri','hsparo','indvul','jglowl',
               'lbicrw','mgprob','rebimg','wcrsrt']
print(img2_output[pred2])

4
cmnmyn


In [61]:
# Testing 3 Class - 'gretit'
img3 = image.load_img('AI_Assignment3/test_data/gretit/8537646712_0b282c4c6a_o.jpg',target_size=(240,240))
img3 = image.img_to_array(img3)
img3 = np.expand_dims(img3,axis=0)
pred3 = np.argmax(model.predict(img3))
print(pred3)
img3_output = ['blasti','bonegl','brhkyt','cbrtsh',
               'cmnmyn','gretit','hilpig','himbul',
               'himgri','hsparo','indvul','jglowl',
               'lbicrw','mgprob','rebimg','wcrsrt']
print(img3_output[pred3])

4
cmnmyn


USING TRANSFER LEARNING

In [1]:
from tensorflow.keras.layers import Dense,Flatten,Input
from tensorflow.keras.models import Model
from tensorflow.keras.preprocessing import image
from tensorflow.keras.preprocessing.image import ImageDataGenerator, load_img
import numpy as np

In [2]:
train_gen = ImageDataGenerator(rescale=(1./255),horizontal_flip=True,shear_range=0.2)
test_gen = ImageDataGenerator(rescale=(1./255))  #--> (0 to 255) convert to (0 to 1)

In [3]:
train = train_gen.flow_from_directory('AI_Assignment3/train_data',
                                      target_size=(224, 224),
                                      class_mode='categorical', 
                                      batch_size=8)
test = test_gen.flow_from_directory('AI_Assignment3/test_data',
                                    target_size=(224, 224),
                                      class_mode='categorical', 
                                      batch_size=8)

Found 150 images belonging to 16 classes.
Found 157 images belonging to 16 classes.


In [4]:
train.class_indices

{'blasti': 0,
 'bonegl': 1,
 'brhkyt': 2,
 'cbrtsh': 3,
 'cmnmyn': 4,
 'gretit': 5,
 'hilpig': 6,
 'himbul': 7,
 'himgri': 8,
 'hsparo': 9,
 'indvul': 10,
 'jglowl': 11,
 'lbicrw': 12,
 'mgprob': 13,
 'rebimg': 14,
 'wcrsrt': 15}

In [5]:
from tensorflow.keras.applications.vgg16 import VGG16, preprocess_input

In [6]:
# Adding the preprocessing layer to the front of vgg

vgg = VGG16(include_top=False,weights='imagenet',input_shape=(224,224,3))

In [7]:
# Train model with existing weights

for layer in vgg.layers:
  print(layer)

<keras.engine.input_layer.InputLayer object at 0x000001CC86DFA650>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88AD59D0>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC8897BD90>
<keras.layers.pooling.max_pooling2d.MaxPooling2D object at 0x000001CC88B0E2D0>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88A9C110>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88B5D6D0>
<keras.layers.pooling.max_pooling2d.MaxPooling2D object at 0x000001CC88AD6C50>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88B0DBD0>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88B2B890>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88B2BE50>
<keras.layers.pooling.max_pooling2d.MaxPooling2D object at 0x000001CC86B45A90>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88B5EBD0>
<keras.layers.convolutional.conv2d.Conv2D object at 0x000001CC88AD4D90>
<keras.layers.convolutional.conv2d.Conv2D object

In [8]:
# Train model with existing weights

for layer in vgg.layers:
  layer.trainable=False

In [9]:
x = Flatten()(vgg.output)

In [10]:
# output layer

prediction = Dense(16,activation='softmax')(x)

In [11]:
# Create Vgg16 model

model1 = Model(inputs=vgg.input,outputs=prediction)

In [12]:
model1.summary()

Model: "model"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
 input_1 (InputLayer)        [(None, 224, 224, 3)]     0         
                                                                 
 block1_conv1 (Conv2D)       (None, 224, 224, 64)      1792      
                                                                 
 block1_conv2 (Conv2D)       (None, 224, 224, 64)      36928     
                                                                 
 block1_pool (MaxPooling2D)  (None, 112, 112, 64)      0         
                                                                 
 block2_conv1 (Conv2D)       (None, 112, 112, 128)     73856     
                                                                 
 block2_conv2 (Conv2D)       (None, 112, 112, 128)     147584    
                                                                 
 block2_pool (MaxPooling2D)  (None, 56, 56, 128)       0     

In [13]:
model1.compile(loss='categorical_crossentropy',optimizer='adam',metrics=['accuracy'])

In [14]:
model1.fit_generator(train,validation_data=test,epochs=4,steps_per_epoch=len(train),
                    validation_steps=len(test))

  model1.fit_generator(train,validation_data=test,epochs=4,steps_per_epoch=len(train),


Epoch 1/4
Epoch 2/4
Epoch 3/4
Epoch 4/4


<keras.callbacks.History at 0x1cc88a92dd0>

In [30]:
# Testing

import numpy as np
from tensorflow.keras.preprocessing import image

In [45]:
# Testing 1 Class - 'blasti'
img1 = image.load_img('AI_Assignment3/test_data/blasti/DSC_6397.jpg',target_size=(224,224))
img1 = image.img_to_array(img1)
img1 = np.expand_dims(img1,axis=0)
pred1 = np.argmax(model1.predict(img1))
print(pred1)
img1_output = ['blasti','bonegl','brhkyt','cbrtsh',
               'cmnmyn','gretit','hilpig','himbul',
               'himgri','hsparo','indvul','jglowl',
               'lbicrw','mgprob','rebimg','wcrsrt']
print(img1_output[pred1])

0
blasti


In [36]:
# Testing 2 Class - 'cmnmyn'
img2 = image.load_img('AI_Assignment3/test_data/cmnmyn/DSC_5137.jpg',target_size=(224,224))
img2 = image.img_to_array(img2)
img2 = np.expand_dims(img2,axis=0)
pred2 = np.argmax(model1.predict(img2))
print(pred2)
img2_output = ['blasti','bonegl','brhkyt','cbrtsh',
               'cmnmyn','gretit','hilpig','himbul',
               'himgri','hsparo','indvul','jglowl',
               'lbicrw','mgprob','rebimg','wcrsrt']
print(img2_output[pred2])

4
cmnmyn


In [44]:
# Testing 3 Class - 'himbul'
img3 = image.load_img('AI_Assignment3/test_data/himbul/6154954471_eefe6e00d1_o.jpg',target_size=(224,224))
img3 = image.img_to_array(img3)
img3 = np.expand_dims(img3,axis=0)
pred3 = np.argmax(model1.predict(img3))
print(pred3)
img3_output = ['blasti','bonegl','brhkyt','cbrtsh',
               'cmnmyn','gretit','hilpig','himbul',
               'himgri','hsparo','indvul','jglowl',
               'lbicrw','mgprob','rebimg','wcrsrt']
print(img3_output[pred3])

7
himbul
