Evaluation always produces mAP of 0.0 when using backbones other than Resnet50 #647

jpxrc · 2018-08-26T02:44:02Z

First and foremost, thank you for the awesome package! The dataset I am using is of satellite images consisting of 29 different classes. I have been able to train and evaluate a retinanet model on this dataset using the default 'densenet50' backbone on a subset of the 29 classes.

However, when I switch over to training and evaluating a model with a different backbone network such as 'densenet121', all of the mAP scores for each class is zero. I'm not receiving any issues when training (I am also using the random-transform flag for each epoch), or when converting the model (I also supply the --backbone='densenet121' flag) and it converts successfully. I can also see that losses being optimized during training so it's definitely detecting and classifying the objects in the images.

I even tried using the original resnet50 model trained on a subset of classes to see if it would pick up those classes on the full dataset with 29 classes and it still produces an output of zero. I looked at the validation_annotations.csv file for both cases and the formatting is identical so I don't think it has to do with the annotation files.

I have attached the validation_annotations.csv file, the classes.csv file (converted to .txt files in order to attach them here)
common_classes.txt
common_validation_annotations.txt

Any ideas what could be going on?

EDIT: I just did a comparison of a Resnet50 model and Densenet121 model both trained on the same dataset that I know for sure works and the problem is definetely with the densenet121 implementation because the Resnet50 model is producing output during evaluation.

hgaiser · 2018-08-30T14:26:42Z

Densenet is a community contribution and I have never really used it. If you find out what the problem is then a PR will be welcome :)

birham-red-bd · 2018-09-04T06:15:36Z

I have the same problem when i changed the backbone to the shuffleNet. The loss is decreasing, but the MAP is always zero. However when i trained model with resnet50 backbone, everything is okay. I still did not find the problem. Can anyone give me some advice?

hgaiser · 2018-09-04T06:24:37Z

Do you evaluate during training, or after training using the evaluate tool? For densenet and mobilenet I noticed there is a bug when preprocessing the images in the evaluate tool. Those backbones require a mode='tf' in the preprocess_image call which isn't there currently.

birham-red-bd · 2018-09-04T07:06:14Z

yeah, I evaluate the map during training. The dataset i used is pascal voc2007. And I did not use pretrained model, because i did not find the pretrained weights of shufflenet-V2 on imagenet.

hgaiser · 2018-09-04T08:44:55Z

@songhuizhong I'm not sure what you mean with shufflenet-V2, we don't have that backbone in our repository.

birham-red-bd · 2018-09-04T09:02:49Z

It is the model I built by myself. here is the article introducing the shufflenet-v2 :https://arxiv.org/abs/1807.11164

hgaiser · 2018-09-07T08:37:43Z

In that case I can't help. I only have experience with ResNet backbones. If you find out the solution to this then a PR is welcome.

birham-red-bd · 2018-09-10T02:42:16Z

well, i find the problem, it is my fault, the reason why map is 0 is because the feature map input into feature pyramid network is in wrong order.
This is the wrong one.
layer_names = ['1x1conv5_out','stage4/block1/relu_1x1conv_1','stage3/block1/relu_1x1conv_1']
and this is the correct one.
layer_names = ['stage3/block1/relu_1x1conv_1','stage4/block1/relu_1x1conv_1','1x1conv5_out']
But I can only trained this model with the highest mAP of 44.59% on dataset of VOC2007 test. The training dataset is voc2012 trainval. The input size is changed to 512*512, because i want to use larger batch size of 28. And i trained this model without using pretrained weights on imageNet.

hgaiser · 2018-09-18T12:40:09Z

Alright, I'll assume this issue resolved then.

hgaiser · 2018-09-18T12:40:33Z

Actually, the original issue was something else entirely, DensNet, so I'm reopening..

jyu-theartofml · 2018-11-02T22:05:12Z

I ran into the same issue using resnet50 for backbone (training and using evaluation.py). I fixed it when I specify image-min-side and image-max-side, otherwise the default values of these parse args don't match up with my image dimension (256x256).

MAGI003769 · 2018-12-07T08:04:21Z

Above all thanks for your awesome work @hgaiser. I met the same problem with @yyannikb . With 'densenet121' as backbone, I got massive detection results and all the scores of boxes are 1. Yes, there is only such a value for score. Concequently, it leads a zero mAP.

My project uses mammography from dataset DDSM. For training, I set image_max_side=666 and image_min_side=400, start with pretrained model from keras repo and learning rate equal to 0.003. Has anyone resolved this problem? I will appreciate that if you can share your experience. Thanks a lot.

wxinbeings · 2019-01-04T08:40:44Z

When I used mobilenet224 to train, got the same issue. Has anyone resolved this problem? I will appreciate that if you can share your experience. Thanks a lot.

ozyilmaz · 2019-02-02T20:33:56Z

Do you evaluate during training, or after training using the evaluate tool? For densenet and mobilenet I noticed there is a bug when preprocessing the images in the evaluate tool. Those backbones require a mode='tf' in the preprocess_image call which isn't there currently.

This was the solution for mobilenet. You have to hack the 'evaluate_coco' method in coco_eval.py.
Change this line: image = generator.preprocess_image(image)
To this: image = generator.preprocess_image(image, mode='tf')

tommysugiarto · 2019-02-23T02:22:14Z

Hi I also got the same problem when tried to inference model with densenet121 backbone, so someone already have idea how to solve that?
Thanks a lot

wxinbeings · 2019-03-06T02:34:10Z

Hi I also got the same problem when tried to inference model with densenet121 backbone, so someone already have idea how to solve that?
Thanks a lot

It's the same issue with mobilenet, just change the same place as @ozyilmaz commented.

tonmoyborah · 2019-03-18T12:21:11Z

@ozyilmaz when I do this, it throws an error that preprocess_image got an unexpected keyword mode. Is it only me or there is an obvious step I am missing?

ozyilmaz · 2019-03-18T12:33:27Z

@ozyilmaz when I do this, it throws an error that preprocess_image got an unexpected keyword mode. Is it only me or there is an obvious step I am missing?

@tonmoyborah , it is hard to guess but it seems like the generator object does not have the correct "preprocess_image" method.

Cospel · 2019-04-30T07:38:56Z

Same happened to me when using mobilenetv1/v2.
However when I set that image has fixed 800x800 size input for training and evaluation than the convergence works great. If I changed it to any other resolution like 801x801 than the model does not converge. If i set the input to the mobilenet inputs = keras.layers.Input(shape=(size, size, 3)) then the model works as expected (and not None, None, 3).

Anyone could explain this strange behaviour?

IntelligentIndia7 · 2019-05-21T05:41:55Z

@tonmoyborah @ozyilmaz Hey, Did you solve the error !!! I am trying to run RetinaNet with Mobilenet224_1.0 backbone and I got map of 0. When I try and train with change in eval.py -> _get_detections method in the line - image = generator.preprocess_image(image) to image = generator.preprocess_image(image, mode = 'tf') as mentioned by @ozyilmaz . I get same error as unexpected keyword mode

IntelligentIndia7 · 2019-05-21T08:47:17Z

when I train normally I get 0 mAP as shown below. Can anyone help me on this?

10000/10000 [==============================] - 3158s 316ms/step - loss: 5.4151 - regression_loss: 2.5757 - classification_loss: 2.8393 - val_loss: 5.3902 - val_regression_loss: 2.5642 - val_classification_loss: 2.8260
Running network: 100% (12704 of 12704) |#################################################################| Elapsed Time: 0:16:48 Time: 0:16:48
Parsing annotations: 100% (12704 of 12704) |#############################################################| Elapsed Time: 0:00:00 Time: 0:00:00
('6066 instances of class', 'M', 'with average precision: 0.0000')
('8803 instances of class', 'W', 'with average precision: 0.0000')
mAP: 0.0000

hgaiser · 2019-05-21T08:56:15Z

Please use the keras-retinanet slack channel for usage questions, or read the readme to find out possible issues.

thusinh1969 · 2019-05-21T22:50:44Z

I have same issue. Backbone densenet201 downloaded weight from keras github. Training with freeze-backbone, custom csv which worked well with any Tensorflow object detection model. Batch 16, dataset 27,000, one single class.

Up to epoch 3 (—> 81,000 iters), retinanet-evaluate produce NO predicted boundingbox on any of 3000 evaluation images! Can someone help... please.

Just to add, MobileNet224_2 also does NOT provide any mAP at all. Very tiring ... :( ...

Steve

liminghuiv · 2019-07-02T03:31:32Z

For mobilenet, I saw keras-retinanet is used in vehicle detection:
https://github.com/yangliupku/retinanet_detection
Can someone merge it?
@hgaiser ?

mariaculman18 · 2019-08-20T06:57:02Z

Dear all, I came upon the same issue with DenseNet-121. While training, mAP is estimated right but when using retinanet-evaluate it is just 0. I know this is a community to contribute, I would like to help resolve this but I am just a beginner. So, does anyone got a way around this? I am using my own csv dataset for training.

hgaiser · 2019-08-20T12:13:48Z

Dear all, I came upon the same issue with DenseNet-121. While training, mAP is estimated right but when using retinanet-evaluate it is just 0. I know this is a community to contribute, I would like to help resolve this but I am just a beginner. So, does anyone got a way around this? I am using my own csv dataset for training.

The reply below is still valid:

Densenet is a community contribution and I have never really used it. If you find out what the problem is then a PR will be welcome :)

You could use resnet50, that should work.

hgaiser · 2019-08-20T12:16:52Z

Could the issue be related to this?

mariaculman18 · 2019-08-22T07:00:07Z

Dear all, I came upon the same issue with DenseNet-121. While training, mAP is estimated right but when using retinanet-evaluate it is just 0. I know this is a community to contribute, I would like to help resolve this but I am just a beginner. So, does anyone got a way around this? I am using my own csv dataset for training.

The reply below is still valid:

Densenet is a community contribution and I have never really used it. If you find out what the problem is then a PR will be welcome :)

You could use resnet50, that should work.

Yes, I already use ResNet-50 as the backbone and wanted to make a comparison with DenseNet-121.

uttaransinha · 2019-08-24T04:53:21Z

Hi,
I've looked into this problem in detail and it seems like the problem lies in the model itself. Any backbone other than ResNet-50 predicts box co-ordinates as -1, labels as -1 and scores as -1. In other words, the model cannot predict anything at all. But since the training phase of the model goes smoothly, I suspect that the conversion of a trained model to an inference model is bugged. Please take a look at the inference conversion code.

mariaculman18 · 2019-08-26T06:33:39Z

Hi,
I've looked into this problem in detail and it seems like the problem lies in the model itself. Any backbone other than ResNet-50 predicts box co-ordinates as -1, labels as -1 and scores as -1. In other words, the model cannot predict anything at all. But since the training phase of the model goes smoothly, I suspect that the conversion of a trained model to an inference model is bugged. Please take a look at the inference conversion code.

Hi @Uttaran-IITH, please see this where with the guidance of @hgaiser @ikerodl96 I found the way around this issue.

uttaransinha · 2019-08-26T12:13:44Z

@mariaculman18, Thank you for the suggestion. Unfortunately, the solution works only for Densenet but not for Resnet101 or Resnet152. In the case of Densenet121, the mAP is very low even on the training data. I assume that the model you have used in the code is the inference model.

python3 ./keras-retinanet/keras_retinanet/bin/evaluate.py --backbone=resnet101 csv train.csv class.csv retinanet_resnet101_inference.h5

mariaculman18 · 2019-08-26T13:10:39Z

@Uttaran-IITH yes, I only tried the solution for DenseNet-121. I can no give you any suggestion for other backbones, sorry :(

In my case, I got a mAP of 96% with ResNet-50 and 90% with DenseNet-121, with the training data.

Jorbo19 · 2019-08-30T03:26:50Z

Hi, I use resnet101, but the loss is always around 1. How can I reduce it?

Jorbo19 · 2019-08-30T03:28:22Z

I only have one class

beibeiZ · 2019-09-05T07:36:51Z

when I train normally I get 0 mAP as shown below. Can anyone help me on this?

10000/10000 [==============================] - 3158s 316ms/step - loss: 5.4151 - regression_loss: 2.5757 - classification_loss: 2.8393 - val_loss: 5.3902 - val_regression_loss: 2.5642 - val_classification_loss: 2.8260
Running network: 100% (12704 of 12704) |#################################################################| Elapsed Time: 0:16:48 Time: 0:16:48
Parsing annotations: 100% (12704 of 12704) |#############################################################| Elapsed Time: 0:00:00 Time: 0:00:00
('6066 instances of class', 'M', 'with average precision: 0.0000')
('8803 instances of class', 'W', 'with average precision: 0.0000')
mAP: 0.0000

retinanet-evaluate --convert-model ./model/resnet50_csv_100.h5 csv ./train.csv ./class.csv
Using TensorFlow backend.
usage: retinanet-evaluate [-h] [--convert-model] [--backbone BACKBONE]
[--gpu GPU] [--score-threshold SCORE_THRESHOLD]
[--iou-threshold IOU_THRESHOLD]
[--max-detections MAX_DETECTIONS]
[--save-path SAVE_PATH]
[--image-min-side IMAGE_MIN_SIDE]
[--image-max-side IMAGE_MAX_SIDE] [--config CONFIG]
{coco,pascal,csv} ... model
retinanet-evaluate: error: argument dataset_type: invalid choice: './model/resnet50_csv_100.h5' (choose from 'coco', 'pascal', 'csv')
why error？can you help me?

ihunhh · 2019-09-12T02:59:24Z

I guess it is caused by this:

yx-wu · 2019-10-12T13:12:33Z

@MAGI003769 hello,I met the same problem with you.With 'densenet201' as backbone,I got strange detection results and all the scores are 1.Has your problem been solved?Thank a lot.

hgaiser · 2019-11-04T10:52:23Z

Has your problem been solved?

I would also like to know if this problem still persists.

yx-wu · 2019-11-04T11:27:31Z

font{ line-height: 1.6; } ul,ol{ padding-left: 20px; list-style-position: inside; } I have solved this problem. wuyuxin wuyuxin@tju.edu.cn 签名由网易邮箱大师定制 On 11/4/2019 18:52，Hans Gaiser<notifications@github.com> wrote： Has your problem been solved? I would also like to know if this problem still persists. —You are receiving this because you commented.Reply to this email directly, view it on GitHub, or unsubscribe.

bhavesh907 · 2019-12-30T15:25:19Z

I'm also facing a very strange problem. I get non-zero mAP value when evaluating during training but when I use convert_model.py and evaluate.py, I get zero mAP values. I'm facing this issue with the efficientnet backbones.

mngata · 2020-01-10T05:11:30Z

I also encounter the same problem. I change the backbone to detnet59, basicly an modificatition to resnet50. I can see the loss decrease while i am training, however, the each-epoch evaluation on the test set is always 0. The resnet50 backbone works well. I was wondering if there is error in the evaluation function.

foghegehog · 2020-02-16T15:41:52Z

@mariaculman18

Hi @Uttaran-IITH, please see this where with the guidance of @ hgaiser @ ikerodl96 I found the way around this issue.

Your solution helped me as well. Why not to make pr? (And use **common_args in all generators)

mariaculman18 · 2020-02-20T07:52:05Z

@mariaculman18

Hi @Uttaran-IITH, please see this where with the guidance of @ hgaiser @ ikerodl96 I found the way around this issue.

Your solution helped me as well. Why not to make pr? (And use **common_args in all generators)

Happy to know it worked for you :)

I guess contributors are aware of the problem. It is better if they make pr, I don't know how to do that :/

foghegehog · 2020-02-25T12:45:51Z

@mariaculman18

Happy to know it worked for you :)

I guess contributors are aware of the problem. It is better if they make pr, I don't know how to do that :/

Well, I dared to create the pr: #1290. Let's see if it suits.

amindehnavi · 2020-05-10T10:48:17Z

when I train normally I get 0 mAP as shown below. Can anyone help me on this?

10000/10000 [==============================] - 3158s 316ms/step - loss: 5.4151 - regression_loss: 2.5757 - classification_loss: 2.8393 - val_loss: 5.3902 - val_regression_loss: 2.5642 - val_classification_loss: 2.8260
Running network: 100% (12704 of 12704) |#################################################################| Elapsed Time: 0:16:48 Time: 0:16:48
Parsing annotations: 100% (12704 of 12704) |#############################################################| Elapsed Time: 0:00:00 Time: 0:00:00
('6066 instances of class', 'M', 'with average precision: 0.0000')
('8803 instances of class', 'W', 'with average precision: 0.0000')
mAP: 0.0000

I have the same problem.have you found the solution?

mariaculman18 · 2020-05-11T14:09:20Z

when I train normally I get 0 mAP as shown below. Can anyone help me on this?
10000/10000 [==============================] - 3158s 316ms/step - loss: 5.4151 - regression_loss: 2.5757 - classification_loss: 2.8393 - val_loss: 5.3902 - val_regression_loss: 2.5642 - val_classification_loss: 2.8260
Running network: 100% (12704 of 12704) |#################################################################| Elapsed Time: 0:16:48 Time: 0:16:48
Parsing annotations: 100% (12704 of 12704) |#############################################################| Elapsed Time: 0:00:00 Time: 0:00:00
('6066 instances of class', 'M', 'with average precision: 0.0000')
('8803 instances of class', 'W', 'with average precision: 0.0000')
mAP: 0.0000

I have the same problem.have you found the solution?

Use the solution here for Densenet.

pleaseRedo · 2020-05-23T06:25:10Z

I also encounter the same problem. I change the backbone to detnet59, basicly an modificatition to resnet50. I can see the loss decrease while i am training, however, the each-epoch evaluation on the test set is always 0. The resnet50 backbone works well. I was wondering if there is error in the evaluation function.

@mngata Have you managed to fix this mAP 0.0 issue? I changed backbone to detnet59 and experienced same issue as well.

amindehnavi · 2020-05-23T08:28:17Z

when I train normally I get 0 mAP as shown below. Can anyone help me on this?
10000/10000 [==============================] - 3158s 316ms/step - loss: 5.4151 - regression_loss: 2.5757 - classification_loss: 2.8393 - val_loss: 5.3902 - val_regression_loss: 2.5642 - val_classification_loss: 2.8260
Running network: 100% (12704 of 12704) |#################################################################| Elapsed Time: 0:16:48 Time: 0:16:48
Parsing annotations: 100% (12704 of 12704) |#############################################################| Elapsed Time: 0:00:00 Time: 0:00:00
('6066 instances of class', 'M', 'with average precision: 0.0000')
('8803 instances of class', 'W', 'with average precision: 0.0000')
mAP: 0.0000

I have the same problem.have you found the solution?

Use the solution here for Densenet.

Thanks,it worked

jpxrc changed the title ~~Evaluation always produces an mAP of 0.0~~ Evaluation always produces mAP of 0.0 Aug 26, 2018

jpxrc changed the title ~~Evaluation always produces mAP of 0.0~~ Evaluation always produces mAP of 0.0 when using models other than Resnet50 Aug 26, 2018

jpxrc changed the title ~~Evaluation always produces mAP of 0.0 when using models other than Resnet50~~ Evaluation always produces mAP of 0.0 when using backbones other than Resnet50 Aug 26, 2018

hgaiser closed this as completed Sep 18, 2018

hgaiser reopened this Sep 18, 2018

hgaiser mentioned this issue Aug 20, 2019

Not evaluation with backbone DenseNet #1103

Closed

hgaiser mentioned this issue Aug 20, 2019

predict scores is lower than evaluate score #1055

Closed

hgaiser added discussion help wanted labels Nov 7, 2019

foghegehog mentioned this issue Feb 17, 2020

evaluate.py fix lacmus-foundation/lacmus#107

Merged

foghegehog mentioned this issue Feb 25, 2020

Pass preprocess_image function to generators in evaluate.py #1290

Merged

andrewboyes mentioned this issue Oct 7, 2020

Getting an mAP of 0.000 using retinanet csv #1473

Closed

Evaluation always produces mAP of 0.0 when using backbones other than Resnet50 #647

Evaluation always produces mAP of 0.0 when using backbones other than Resnet50 #647

Comments

jpxrc commented Aug 26, 2018 • edited

hgaiser commented Aug 30, 2018

birham-red-bd commented Sep 4, 2018

hgaiser commented Sep 4, 2018

birham-red-bd commented Sep 4, 2018

hgaiser commented Sep 4, 2018

birham-red-bd commented Sep 4, 2018

hgaiser commented Sep 7, 2018

birham-red-bd commented Sep 10, 2018

hgaiser commented Sep 18, 2018

hgaiser commented Sep 18, 2018

jyu-theartofml commented Nov 2, 2018

MAGI003769 commented Dec 7, 2018 • edited

wxinbeings commented Jan 4, 2019

ozyilmaz commented Feb 2, 2019

tommysugiarto commented Feb 23, 2019

wxinbeings commented Mar 6, 2019

tonmoyborah commented Mar 18, 2019

ozyilmaz commented Mar 18, 2019

Cospel commented Apr 30, 2019 • edited

IntelligentIndia7 commented May 21, 2019

IntelligentIndia7 commented May 21, 2019

hgaiser commented May 21, 2019

thusinh1969 commented May 21, 2019 • edited

liminghuiv commented Jul 2, 2019

mariaculman18 commented Aug 20, 2019

hgaiser commented Aug 20, 2019

hgaiser commented Aug 20, 2019

mariaculman18 commented Aug 22, 2019 • edited

uttaransinha commented Aug 24, 2019

mariaculman18 commented Aug 26, 2019

uttaransinha commented Aug 26, 2019

mariaculman18 commented Aug 26, 2019

Jorbo19 commented Aug 30, 2019

Jorbo19 commented Aug 30, 2019

beibeiZ commented Sep 5, 2019

ihunhh commented Sep 12, 2019

yx-wu commented Oct 12, 2019

hgaiser commented Nov 4, 2019

yx-wu commented Nov 4, 2019 via email

bhavesh907 commented Dec 30, 2019

mngata commented Jan 10, 2020

foghegehog commented Feb 16, 2020

mariaculman18 commented Feb 20, 2020

foghegehog commented Feb 25, 2020 • edited

amindehnavi commented May 10, 2020

mariaculman18 commented May 11, 2020

pleaseRedo commented May 23, 2020 • edited

amindehnavi commented May 23, 2020

jpxrc commented Aug 26, 2018 •

edited

MAGI003769 commented Dec 7, 2018 •

edited

Cospel commented Apr 30, 2019 •

edited

thusinh1969 commented May 21, 2019 •

edited

mariaculman18 commented Aug 22, 2019 •

edited

foghegehog commented Feb 25, 2020 •

edited

pleaseRedo commented May 23, 2020 •

edited