Skip to content

Recipes for computer vision - educational and research use

License

Notifications You must be signed in to change notification settings

mawady/cv-recipes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Recipes for computer vision - educational and research use


Filenaming: Task-Stage-Model-Dataset-Framework

  • Task: ObjectClassification, ObjectDetection, etc...
  • Stage: Train, FineTune, Eval, Infer
  • Model: ResNet, YoloV3, etc...
  • Dataset: ImageNet, COCO, etc...
  • Library: PyTorch, Tensorflow, Keras

Basic level

Object classification

  • ObjectClassificationBinary_FineTune_ResNet_CatsVsDogs_Pytorch Open In Colab
  • ObjectClassificationBinary_Infer_ResNet_CatsVsDogs_Pytorch Open In Colab Open In HuggingFace - Spaces
  • ObjectClassificationMultiClass_FineTune_ResNet_MNIST_Pytorch Open In Colab

Object detection

  • ObjectDetection_Infer_YoloV3_COCO_OpenCV Open In Colab

Object segmentation

  • ObjectSegmentationPanoptic_Infer_DETR_COCO_PyTorch Open In Colab
  • ObjectSegmentationInstance_Infer_MaskRCNN_COCO_PyTorch Open In Colab
  • ObjectSegmentationPanopticAndSemantic_Infer_Mask2Former_Cityscapes_PyTorch Open In Colab

Notes

  • For single-label classification, mirco F1 and accruacy are equal. For multi-label, they are not.
  • For imbalance dataset, use weighted F1 instead of macro F1.
  • Training hacks/Fastai: progressive resizing, gradual unfreezing
  • Baseline - Object Classification [single/multi class, single/multi label]: ResNet, ViT, EfficientNet
  • Baseline - Object Detection [single/multi class, single/multi label]: YoloV3-5, RetinaNet, FasterRCNN
  • Baseline - Object Segementation [semantic / instance]: MaskRCNN, U-NET, FCN, DeepLabV3
  • MultilabelStratifiedKFold: https://github.com/trent-b/iterative-stratification