Single Shot Multibox Detector (SSD) (paper) implementation in TensorFlow, in development.
Results of some hand-picked test images through an experimental run with MS COCO, some good and some bad:
Just looking through them, the results are okay but not good enough.
However, there are still major things needed to do that was done in the original paper for COCO but not here:
- Train on 500x500 images (this was 300x300)
- Use COCO trainval (this was only train)
- Use batch size 32 (this was only 8)
Other major improvements needed:
- Implement proper evaluation (mAP)
- Optimize training (currently pretty slow)
- Simple momentum optimizer stopped working (stopped converging) at some point during development, but adding batch normalization made it work again
- MS COCO tools
- This project is still under development--it's especially slow, but here are some instructions anyway.
You need vgg16.npy from this repository, which is what I used for the base network. Unfortunately, it's a big file, and for now it's just uploaded to Mega, so you might need an account. Direct link to npy
For now, the code uses and depends on MS COCO. You need the MS COCO dataset from here. You should at least have the 2014 training images and corresponding 2014 train/val object instance annotations.
Download and install COCO tools from here
Change the COCO paths in coco_loader.py
Test or train with trainer.py