Skip to content

Kaushal28/Global-Wheat-Detection

Repository files navigation

The challenge in this Kaggle competition is to detect the wheat heads and draw bounding boxes around all the detected wheat heads in the given image. Here are a few example of the predictions generated using FasterRCNN model:

Here are a few approaches I tried to approach the problem with brief description and achieved score on public leader board (score is mAP (mean average precision) in this competitions).

Experiment #1 (0.6426 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet50 backbone
  • No image augmentations
  • Other settings:
    • Image size:1024 x 1024
    • No CV
    • 5 Epochs
    • Batch size: 16 images

Experiment #2 (0.6547 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet50 backbone
  • No image augmentations
  • 1 round of training on pseudo Labels
  • 5 epochs to train on original data, 4 epochs for training on pseudo labels

Experiment #3 (0.6575 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet50 backbone
  • No image augmentations
  • 2 rounds of training on pseudo Labels
  • 5 epochs to train on original data, 4 epochs/round for training on pseudo labels

Experiment #4 (0.6528 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet50 backbone
  • No image augmentations
  • 3 rounds of training on pseudo Labels
  • 5 epochs to train on original data, 4 epochs/round for training on pseudo labels

Experiment #5 (0.6498 on public LB)

Experiment #6 (0.6826 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet50 backbone
  • Random Brightness, Contrass, Horizontal and Vertical flips and Hue Saturation Value Image Augmentations
  • 2 rounds of psuedo labeling
  • 12 epochs on augmented dataset (mentioned on exp #5) and 4 epochs/round for training on pseudo labels

Experiment #7 (0.6582 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet50 backbone
  • Random Brightness, Contrass, Horizontal and Vertical flips, Hue Saturation Value Image Augmentations and Random Cutout (with handling of bounding boxes)
  • No pseudo labeling
  • 20 epochs on augmented dataset: https://www.kaggle.com/kaushal2896/gwdaugmented/version/4

Experiment #8 (0.6624 on public LB)

Experiment #9 (0.6817 on public LB)

  • Ensemble of Exp. #8 and Exp. #7
  • Used WBF (Weighted Boxes Fusion) as an ensemble technique.

Experiment #10 (0.6824 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet101 backbone
  • Random Brightness, Contrass, Horizontal and Vertical flips, Hue Saturation Value Image Augmentations
  • 5-fold Cross Validation with WBF ensemble
  • 5 epochs per fold

Experiment #11 (0.6910 on public LB)

  • Used model: COCO Pre-trained FasterRCNN with ResNet101 backbone
  • Horizontal and Vertical flips, Only one of Random Brightness Contrass or Hue Saturation Value Image Augmentations
  • 5-fold Cross Validation with WBF ensemble
  • 20 epochs per fold

Experiment #12 (0.6859 on public LB)

  • Used model: Pretrained EfficientDet D5
  • Horizontal and Vertical flips, Only one of Random Brightness Contrass or Hue Saturation Value Image Augmentations
  • No CV
  • 3 epochs

Experiment #13 (0.7147 on public LB)

  • Used model: Pretrained EfficientDet D5
  • Horizontal and Vertical flips, Only one of Random Brightness Contrass or Hue Saturation Value Image Augmentations
  • No CV
  • 20 epochs

Experiment #14 (0.7194 on public LB)

  • Used model: Pretrained EfficientDet D5
  • Horizontal and Vertical flips, Only one of Random Brightness Contrass or Hue Saturation Value Image Augmentations
  • 5 fold CV with EBF ensemble
  • 30 epochs per fold

Experiment #15 (0.7490 on public LB)

  • Pretraied two EfficientDet D5 models trained on full dataset for 30 epochs
  • Horizontal and Vertical flips, Only one of Random Brightness Contrass or Hue Saturation Value Image Augmentations
  • Pseudo labeling + TTA + Ensemble

Experiment #16 (0.7432 on public LB)

  • Pretrained YOLOv5x
  • Default yolov5 augmentations (includes all above augmentations)
  • 5 Fold CV
  • 35 epochs per fold

About

Object Detection for wheat heads in the given images

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published