Applied Machine Learning - Project: Feather in Focus! (BIRDeep)

This is the GitHub repository for the Applied Machine Learning Project: Feather in Focus! This project is associated with a private Kaggle competition, and you can find more details about the competition here.

Description

The goal of the challenge is to classify the bird image and find the name of the bird! This dataset contains 200 bird species, and the goal is to achieve a high accuracy in predicting the birds!

Introduction

BirdDeep's model strives for increased robustness by decoupling forward and background image subsets. This is achieved through background removal, comple-mented by other accuracy enhancing features including:

Data Balancing
Data augmentation
Hyperparameter Tuning
Use of pre-trained Models

Research

Noise or signal: The role of image backgrounds in object recognition presents several background removal methods. BIRDeep uses the Only-FG format in Figure 1. This choice eliminates background interactions, enabling the model to concentrate solely on bird characteristics.

Figure 1: Obtained from Xiao, K., Engstrom, L., Ilyas, A., & Madry, A. (2020). Noise or signal: The role of image backgrounds in object recognition. arXiv preprint arXiv:2006.09994.

The featured pre-trained model is Xception. This model employs depthwise separable convolution: a pointwise convolution succeeded by a depthwise convolution.

Figure 2: Xception convolution procedure. A 1x1 pointwise convolution is performed to then apply nxn depthwise convolutions.

Data Augmentation

To emulate the Only-FG image format, the system performs interactive foreground selection using GrabCut. This method introduces a simplicity tradeoff, occasionally yielding suboptimal results due to a predefined cutting area. In BIRDeep, a rectangular boundary (50, 50, 200, 200) is set over 299x299 pixel images, as illustrated in Figure 3.

Figure 3: BirdDeep’s foreground and background extraction process based on GrabCut masking.

Each foreground only image is incorporated to the training dataset alongside four copies subjected to data augmentation.

Image horizontal flip
Image 90 degrees rotation
Image contrast increased by 50%
Image contrast decreased by 50%

Figure 4: BirdDeep’s data augmentation processes applied over a single image.

Results

Baseline Model

The Baseline Model achieved a test accuracy of 58%.

Mixed Model

The Mixed Model approach can be found here and involves three jupyter notebooks:

Generating Subfolder - Generates the needed folder structure for the Mixed Model
Data Balancing - Performs data balancing to only use 27 files
Mixed Model achieved a test accuracy of 60%. This approach includes mainly augmented Only-FG images, but also some raw images.

All the other Branches were created for testing purposes and can be ingnored regarding the sumbission for Applied Machine Learning.

Contact

Name	Email
Alina Baciu	alina.baciu@student.uva.nl
Thomas Erhard	thomas.erhard@student.uva.nl
Jaime Pons	jaime.pons@student.uva.nl
Leonardo Provenzano	leonardo.provenzano@student.uva.nl

Project Link: https://github.com/Jaime47/BIRDeep

Images Loading

To facilitate the efficient loading of images, we developed a function that creates distinct subfolders for each class and organizes the images accordingly. To streamline the process of transferring the dataset across different machines, such as Colab and Snellius, we uploaded the directory containing both train and test images to Roboflow. Leveraging Roboflow's export capabilities, we were able to easily move the images to our desired destinations whenever needed.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Baseline Model		Baseline Model
Mixed Model		Mixed Model
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Applied Machine Learning - Project: Feather in Focus! (BIRDeep)

Description

Introduction

Research

Data Augmentation

Results

Baseline Model

Mixed Model

Contact

Images Loading

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Applied Machine Learning - Project: Feather in Focus! (BIRDeep)

Description

Introduction

Research

Data Augmentation

Results

Baseline Model

Mixed Model

Contact

Images Loading

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages