Glomeruli Detection and Semantic Segmentation (kaggle competition)

The created dataset has 132K images
Time Taken to create 132K images and labels of size 224px is 9mins
Total 162K images of size 224px of which 128K are glomeruli images
Something weird is happening transposing img on the axis (2,0,1) rather than reshaping inc the accuracy by 15%.

can Convolution 3D be added to UNet
IMPORTANT reduce parameters present in the model in a strategic way because model is always overfitting.
Do changing the brightness of img is really need?
Should we change the size of the image?

[ X ] the model was overfitting because there was to many parameters to just classify wheter an image has glomeruli or not.
Current model accuracy is 84% with 3 epochs with Augmentation 3 images.
[ ] Goal accuracy is 95%.
Model is not able to recognize Aug 3 images i.e. images that has different brightness.

can we add conv Nets to VIT? yes.
Instead of Dot product between Keys and queries there should be matrix multiplication i.e. use vector attention.
Added Conv to VIT for upsampling and got acc of 88.37% and loss of 0.27 with 2 epochs without Aug 3 Images.
Which loss function will be the best ?
1. BCELossWithLogits
2. Focal Loss
3. Dice Loss
4. Weighted BCELossWithLogits
Ideas to improve the current VIT-Conv model? -> Nothing work.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
VIT_semantic_segmentation		VIT_semantic_segmentation
basic-unet		basic-unet
classification-segmentation		classification-segmentation
convAttention		convAttention
ensemble_stuff		ensemble_stuff
multiscale_fcn_attn		multiscale_fcn_attn
myModel		myModel
.gitignore		.gitignore
10		10
create_dataset.py		create_dataset.py
create_dataset_v2.py		create_dataset_v2.py
readme.md		readme.md
testing_grounds.ipynb		testing_grounds.ipynb

IAmPara0x/GlomeruliSegmentation