This project investigated the effect that self-supervised backbones on the Faster RCNN based object detection model. This work was done as part of the Master Practical course on Data Innovation Lab in collabaration with the Machine Learning team of Precibake.
The models were implemented using the Detectron2 framework from Facebook AI.The full report and final presentation are attached in the repository.
On training a ResNet-50 feature extractor for a Faster RCNN detector using the self-supervised method of DINO and DetCO as a pretext for object detection, it was found that only those contrastive self-supervsion techniques that preserve global and contextual embeddings from a image can improve the performance on detection!
Using FPN with ResNet speeds up training significantly, while giving similar performance for the same amount of epochs.
This is really useful to speed up network training without loss of accuracy. FPN also help detecting object at different scales easier.
Checkout the report and presentation for a more detailed and complete overview!

