Running Colab code does not give output. #4

samjoy · 2020-12-13T07:59:07Z

When I run the colab demo code, it does not produce the correct predictions. In the image titled 'Predictions', there are no bounding boxes or labels that appear.

benihime91 · 2020-12-13T08:26:36Z

@samjoy What was your IOU score? If it's low you need to set threshold to be low.
To increase IOU score you probably need to train for more epochs. For good results IOU score should be atleast above 0.4
Try using a different Optimizer/Scheduler , disable Early Stopping , tune hyperparameters for better results.
The demo was not created for best results it's just a notebook that shows how to use this repo.

samjoy · 2020-12-13T09:18:02Z

Ok I am running your code once more. The dataset you you used in the demo is in Pascal XML VOC format right?

samjoy · 2020-12-13T09:53:25Z

Ok my AP is 0 when running the demo. Any idea why?

benihime91 · 2020-12-13T11:25:17Z

Can you give me a link to the colab notebook ?

samjoy · 2020-12-13T11:26:42Z

I mean I did nothing but run your notebook directly. No change

benihime91 · 2020-12-13T11:29:45Z

Wait let me check I think I made some changes to the API but forget to update the notebook

benihime91 · 2020-12-13T11:31:17Z

Did you change these ?

# INSTANTIATE LIGHTNING-TRAINER with CALLBACKS :
# ============================================================ #
# NOTE: 
# For a list of whole trainer specific arguments see : 
# https://pytorch-lightning.readthedocs.io/en/latest/trainer.html

lr_logger  = LearningRateMonitor(logging_interval="step")
early_stop = EarlyStopping(mode="min", monitor="val_loss", patience=8, )

#instantiate LightningTrainer
trainer    = Trainer(precision=16, gpus=1, callbacks=[lr_logger, early_stop], max_epochs=50, weights_summary="full", )

benihime91 · 2020-12-13T11:31:45Z

Whats your loss ?

samjoy · 2020-12-13T11:31:58Z

I did make one change. Instead of litModel = RetinaNetModel(hparams=hparams), i used litModel = RetinaNetModel(hparams)

samjoy · 2020-12-13T11:33:33Z

The loss is 5.2

benihime91 · 2020-12-13T11:33:50Z

That's okay I changes the name of the argument to conf any ways...
I think it will be better if you give me a link to you colab

save colab as github gist and give me the link .... I wil get back to you

benihime91 · 2020-12-13T11:34:10Z

5.2 after how many epochs ? It's too high...

samjoy · 2020-12-13T11:34:48Z

In 10 epochs with early stopping

samjoy · 2020-12-13T11:35:15Z

I am now running without early stopping but max epochs=50

samjoy · 2020-12-13T11:36:05Z

I am using Pascal XML VOC format from roboflow.

samjoy · 2020-12-13T11:40:14Z

I just ran it again without early stopping but max_epochs = 50 , I am getting loss of 5.28.

benihime91 · 2020-12-13T11:42:51Z

Mine loss if less than 2 even in 1 epoch same basic params. Let it train for some more ill share the gist

samjoy · 2020-12-13T12:04:56Z

Are you using the BCCD dataset in the Pascal VOC XML format?

benihime91 · 2020-12-13T12:05:50Z

Yes

benihime91 · 2020-12-13T12:07:36Z

So i did a bit of tunning my optimizer config looks like this now

hparams.optimizer = {
    "class_name": "torch.optim.SGD", 
    "params"    : {"lr": 0.005, "weight_decay": 0.0001, "momentum":0.9},
    }

Current epoch 14 and loss=0.37

Im training for 40 epochs so once thats over ill share the notebook

samjoy · 2020-12-13T12:13:38Z

ok thanks

benihime91 · 2020-12-13T12:24:13Z

Please check this : https://colab.research.google.com/gist/benihime91/00996411c8174a81f6c1389750012103/github-retinanet-demo.ipynb

40 epochs, loss = 0.543 , classification_loss=0.233, regression_loss=0.153, val_loss=0.435

coco-evaluation results:

IoU metric: bbox
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.322
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.751
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.154
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.333
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.321
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.491
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.272
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.405
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.451
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.433
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.398
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.567

samjoy · 2020-12-13T12:24:57Z

I will run it now and let you know about the results

benihime91 · 2020-12-13T12:28:58Z

If it still doesn't work try installing pytorch-lightning=1.0.0 (but i don't think that should be an issue 😌) and share me your notebook

samjoy · 2020-12-13T12:31:03Z

Yeah sure :)

samjoy · 2020-12-13T13:12:35Z

I ran your notebook exactly and I am getting poor results.
40 epochs, loss=2.91, v_num=0, classification_loss=2.39, regression_loss=0.518, val_loss=2.88]

DATALOADER:0 TEST RESULTS
{'AP': tensor(0., dtype=torch.float64),
'val_loss': tensor(2.8816, device='cuda:0')}

[{'AP': 0.0, 'val_loss': 2.8816070556640625}]

samjoy · 2020-12-13T13:13:56Z

I just opened the link and ran the notebook and I am getting the above poor results

benihime91 · 2020-12-13T13:39:09Z

Can u try with pytorch-lightning version 1.0.0.. Just to pip install pytorch-lightning=1.0.0

benihime91 · 2020-12-13T13:40:07Z

If it doesn't work please share me your notebook.. Or else I'm afraid i won't be able to do anything more

samjoy · 2020-12-13T13:45:21Z

Did you run your notebook on colab or someother platform?

benihime91 · 2020-12-13T13:46:24Z

on colab itself

samjoy · 2020-12-13T13:48:35Z

can you list the versions of all the essential libraries that you used such as pytorch lightining, pytorch, torchvision, etc?

benihime91 · 2020-12-13T13:52:39Z

The only library that may cause conflicts in pytorch-lightning beacause they had some massive changes... So that why i am saying try with pytorch-lightning version 1.0.0. Other libraries are all deafult installed in colab and Omegaconf and albumentations should not cause conflicts

!pip install pytorch-lightning=1.0.0

samjoy · 2020-12-13T13:54:19Z

ok I will do that now

benihime91 · 2020-12-13T13:59:44Z

and also please share your notebook or i am out of options

samjoy · 2020-12-13T15:10:58Z

Its working now. You are right. Its due to the pytorch-lightning version. Thanks for your help,
So I was comparing the original image and the predictions. Some of the bounding boxes do not align. Any tips how to fix this?

benihime91 · 2020-12-13T15:13:40Z

That's to be expected you need to do hyper-parameter tuning try using Adam/AdamW optimizer , train more more epochs. As your AP increases the accuracy of the bounding boxes will also increase.

samjoy · 2020-12-13T15:41:13Z

Ok I will look into that. Again thanks for your help

benihime91 · 2020-12-13T15:45:02Z

I have updated the colab notebook and requirements.txt be sure to get the latest ones

eapolo · 2021-12-02T23:43:34Z

Hey man:
I get this error
ValueError: Expected y_max for bbox (0.009765625, 0.94140625, 0.05859375, 1.001953125, 1) to be in the range [0.0, 1.0], got 1.001953125.

benihime91 closed this as completed Dec 13, 2020

benihime91 pinned this issue Dec 13, 2020

Running Colab code does not give output. #4

Running Colab code does not give output. #4

Comments

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020 • edited Loading

samjoy commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

benihime91 commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

samjoy commented Dec 13, 2020

samjoy commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

benihime91 commented Dec 13, 2020 • edited Loading

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020 • edited Loading

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

samjoy commented Dec 13, 2020

DATALOADER:0 TEST RESULTS {'AP': tensor(0., dtype=torch.float64), 'val_loss': tensor(2.8816, device='cuda:0')}

samjoy commented Dec 13, 2020 • edited Loading

benihime91 commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

samjoy commented Dec 13, 2020

benihime91 commented Dec 13, 2020

eapolo commented Dec 2, 2021

benihime91 commented Dec 13, 2020 •

edited

Loading

benihime91 commented Dec 13, 2020 •

edited

Loading

samjoy commented Dec 13, 2020 •

edited

Loading

DATALOADER:0 TEST RESULTS
{'AP': tensor(0., dtype=torch.float64),
'val_loss': tensor(2.8816, device='cuda:0')}

samjoy commented Dec 13, 2020 •

edited

Loading