RetinaNet (pytorch-lightning)

RetinaNet is a one-stage object detection architecture introduced in 2017. Please refer to the original paper for more details.

Training

Prerequisites:

CUDA enabled GPU
CUDA version 10.1 or higher

Clone the repo:

git clone https://github.com/bishwarup307/retinanet-lightning.git

install the requirements (preferably in a virtual environment):

cd retinanet-lightning
pip install -r requirements.txt

Format your dataset:

Your data needs to be in COCO format and have the following directory structure:

root
    ├── images
    │    ├── train
    │    ├── val
    │    ├── test
    │
    └── annotations
         ├── train.json
         ├── val.json
         ├── test.json

make changes in the retinanet/config.yaml. All your training parameters (e.g., learning rate, batch size, augmentations and more) can be configured there.
Run training

python train.py

Run tensorboard:

tensborboard --logdir <logdir>/lightning_logs

where logdir is the logdir you specified in the config.yaml.

Distributed Training

If you want to run the training in distributed mode just specify the number of gpus (gpus) you want to use in the Trainer section of config.yaml. By default it uses torch DistributedDataParallel mode. See pytorch lightning multi-gpu training for more options.

FP-16 Training

If you want to take advantage of 16 bit precision training, you can also do that by setting amp as True in the Trainer section of config.yaml. We recommend using native as your amp_backend which uses pytorch's native automatic mixed precision. However, in case you want to use APEX, that can be configured as well.

Testing

python test.py \
--root /directory/with/test/images \
--image-size 512 \
--weights path/to/checkpoint.ckpt \
--batch-size 8 \
--output-dir path/to/output/dir \

Export to ONNX

The best model weights (best epoch) is exported to ONNX named best.onnx inside the logdir as part of the training procedure.

The ONNX model outputs anchors, logits and offsets where:

anchors are the anchor boxes. Shape: (A, 4), where A is the number of anchors given an image
logits are the class confidences Shape: (A, C) where C is the number of classes
offsets are the normalized offsets to the anchors. Shape (A, 4).

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
coco		coco
retinanet		retinanet
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

coco

coco

retinanet

retinanet

.gitignore

.gitignore

README.md

README.md

config.yaml

config.yaml

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

test.py

test.py

train.py

train.py

Repository files navigation

RetinaNet (pytorch-lightning)

Training

Distributed Training

FP-16 Training

Testing

Export to ONNX

About

Releases

Packages

Languages

bishwarup307/retinanet-lightning

Folders and files

Latest commit

History

Repository files navigation

RetinaNet (pytorch-lightning)

Training

Distributed Training

FP-16 Training

Testing

Export to ONNX

About

Resources

Stars

Watchers

Forks

Languages