MultiFlare

The Dataset provided has an excel sheet with columns as image_id, split_type(train or test), Target class(class 1 to class 29) which is the binarized conversion of the target Very similar to One-hot encoding

The Training dataset has 64674 images and test dataset has 7186 images. All the preprocessing is done on the training dataset and validation is done on the test dataset. The Training dataset is highly imbalanced and images have different sizes (1024x1024x3, 475x475x3) etc. Resizing is performed on the images to a standard size of 400 x 400 x 3

dataframe snippet

Histogram showing the level of imbalance

Evaluation Metrics

Primary metrics used for the Project is F1 micro and F1 macro. The Best-fit model has a F1 macro score of 0.459 and F1 micro of 0.39 on eval set

Modeling

I trained 2 models - Pretrained resNet model and a custom CNN model. The Best fit is a Convolutional Neural Network with 7 convolutional layers. Every Conv2d layer is developed with a BatchNorm and a pooling layer. Some hyperparameters like activation function, batch sizes and learning rates were experimented with and optimized during the course of the project.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Final report.pdf		Final report.pdf
README.md		README.md
aug.py		aug.py
fresh_shanun.py		fresh_shanun.py
summary_Jeanne.txt		summary_Jeanne.txt
test_Jeanne.py		test_Jeanne.py
test_cluster.py		test_cluster.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MultiFlare

The Dataset provided has an excel sheet with columns as image_id, split_type(train or test), Target class(class 1 to class 29) which is the binarized conversion of the target Very similar to One-hot encoding

dataframe snippet

Histogram showing the level of imbalance

Evaluation Metrics

Modeling

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

shanunrandev123/MultiFlare

Folders and files

Latest commit

History

Repository files navigation

MultiFlare

The Dataset provided has an excel sheet with columns as image_id, split_type(train or test), Target class(class 1 to class 29) which is the binarized conversion of the target Very similar to One-hot encoding

dataframe snippet

Histogram showing the level of imbalance

Evaluation Metrics

Modeling

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages