How do I interpret evaluation results on my custom dataset? #663

Suvi-dha · 2018-06-11T05:49:29Z

After performing training, I ran evaluation code on 500 images to which I received following results.
Can any one help me understand these results. Any help is much appreciated.

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.061
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.075
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.059
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.094
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.099
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.099
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.099
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.149

The text was updated successfully, but these errors were encountered:

qmzsuxing · 2018-06-12T06:44:41Z

@Suvi-dha have you solved your problem? I also wonder what does this mean...

Suvi-dha · 2018-06-12T11:04:16Z

yes, I read about it. The values as stated are average precision and average recall on IoU(intersection over union) thresholds between 0.5 and 0.95 where maximum detections over which it is calculated is 100. In other words, precision is calculated over number of positive detections (greater than IoU threshold) by total number of TPs and FPs. and then the average of precision values over each threshold is calculated to obtain the final value in the last column.
Similarly for recall.
Also, they have calculated Average Precision (AP) and AR over different areas (all, small, medium and large)

ashnair1 · 2019-01-24T07:31:28Z

@Suvi-dha So how exactly would this result be reported? Is your mAP = 0.061? Isn't this an extremely low value? I'm facing a similar issue where my AP score for all IOU thresholds is 0.075. I don't understand how to actually report the score of my model.

Suvi-dha · 2019-01-24T13:16:41Z

MaskRCNN doesn't work on my dataset due to many issues, that's why the results are low. I tried training by changing hyperparameters to see the effect. So, you can also try changing them and then observe the results. May be they will improve.
I don't completely understand what do you mean by reporting the score of your model.
If you are comparing your model performance with maskRCNN then the metrics you are using for reporting your result should also be calculated from MRCNN, or the other way round.

ashnair1 · 2019-01-24T13:59:32Z

@Suvi-dha By reporting the score I mean reporting the mAP score since that's the common evaluation metric. I tried calculating the mAP score using the compute_ap function as in the train_shapes.ipynb and that gave me a somewhat reasonable mAP score of 0.272 or 27.2. I guess I'll just go with that.

For reference, my dataset consists of satellite images

eyildiz-ugoe · 2019-03-29T10:00:09Z

I am having exactly the same problem. What functions do you guys use to evaluate your model? Where are these functions located? train_shapes.ipynb only has mAP, that's not enough. We need to evaluate it using the metrics given by COCO. (AP50, AP75, AP_S...etc)

banafsh89 · 2019-07-29T12:20:36Z

@Suvi-dha Could you please tell me which part of the code gives you Average Recall. I can only calculate AP using utils.compute_ap. I don't know how to calculate Average Recall!

Suvi-dha · 2019-07-29T12:47:05Z

@Suvi-dha Could you please tell me which part of the code gives you Average Recall. I can only calculate AP using utils.compute_ap. I don't know how to calculate Average Recall!

you can calculate average recall from utils.compute_recall Also, compute_ap also returns iou, precision and recalls.

Suvi-dha · 2019-07-29T12:49:29Z

I am having exactly the same problem. What functions do you guys use to evaluate your model? Where are these functions located? train_shapes.ipynb only has mAP, that's not enough. We need to evaluate it using the metrics given by COCO. (AP50, AP75, AP_S...etc)

Search in utils.py.

banafsh89 · 2019-07-29T14:27:40Z

Thanks, but that is only recall in each iou range! how should I find the average recall! For example if you look at compute_ap, it finds mAP as "Compute mean AP over recall range". Do you have any idea bout that?

Suvi-dha · 2019-08-01T14:43:04Z

Thanks, but that is only recall in each iou range! how should I find the average recall! For example if you look at compute_ap, it finds mAP as "Compute mean AP over recall range". Do you have any idea bout that?

you have to use pycocotools library if you want to calculate the results like as shown above in my first comment, on your dataset.

from pycocotools.coco import COCO
from pycocotools.cocoeval import COCOeval
from pycocotools import mask as maskUtils

check the code for

cocoeval

.

sineagles · 2020-03-18T13:13:03Z

Do you solve this problem? I meet the same problem same to yours. Can you tell me how correct it?

sineagles · 2020-03-21T10:03:35Z

Do you solve this problem? I meet the same problem same to yours. Can you tell me how correct it?

iii

jdmelendez · 2020-05-07T07:11:27Z

@ Suvi-dha Entonces, ¿cómo se informará exactamente este resultado? ¿Es su mAP = 0.061? ¿No es este un valor extremadamente bajo? Me enfrento a un problema similar en el que mi puntaje AP para todos los umbrales de IOU es 0.075 . No entiendo cómo informar realmente el puntaje de mi modelo.

If you're training with COCO, the mAP score is the AP. Perhaps is this what you are looking for. Reading on http://cocodataset.org/#detection-eval , in "2.Metrics" there is the next line:

"AP is averaged over all categories. Traditionally, this is called "mean average precision" (mAP). We make no distinction between AP and mAP (and likewise AR and mAR) and assume the difference is clear from context."

WillianaLeite · 2020-05-10T03:41:18Z

@ Suvi-dha Você poderia me dizer qual parte do código fornece a média de recall? Só posso calcular o AP usando utils.compute_ap. Não sei como calcular o recall médio!

você pode calcular o recall médio a partir de utils.compute_recall Além disso, o compute_ap também retorna iou, precision e recalls.

@Suvi-dha Could you please tell me which part of the code gives you Average Recall. I can only calculate AP using utils.compute_ap. I don't know how to calculate Average Recall!

you can calculate average recall from utils.compute_recall Also, compute_ap also returns iou, precision and recalls.

Could you please explain to me how to calculate mAR from the "utils.compute_recall" function, I understand that it returns the AR, but how should I calculate the mAR? Please help me!!

Joanne513 · 2020-06-19T06:38:26Z

After performing training, I ran evaluation code on 500 images to which I received following results.
Can any one help me understand these results. Any help is much appreciated.

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.061
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.075
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.059
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.094
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.099
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.099
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.099
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.149

how you got this result? i only can use the compute_ap function to calcute the AP @ IoU = 0.5, how you got so many IoUs? can you help me!

elia86 · 2020-07-02T13:03:08Z

@ Suvi-dha Você poderia me dizer qual parte do código fornece a média de recall? Só posso calcular o AP usando utils.compute_ap. Não sei como calcular o recall médio!

você pode calcular o recall médio a partir de utils.compute_recall Além disso, o compute_ap também retorna iou, precision e recalls.

@Suvi-dha Could you please tell me which part of the code gives you Average Recall. I can only calculate AP using utils.compute_ap. I don't know how to calculate Average Recall!

you can calculate average recall from utils.compute_recall Also, compute_ap also returns iou, precision and recalls.

Could you please explain to me how to calculate mAR from the "utils.compute_recall" function, I understand that it returns the AR, but how should I calculate the mAR? Please help me!!

Hi @WillianaLeite ! Have you found a way to calculate mAR? I have the same issue here :)

MALLI7622 · 2021-01-19T11:38:33Z

@Suvi-dha @ashnair1 Can anybody help to resolve this issue...?

keremtatlici · 2021-01-27T11:23:50Z

@MALLI7622 Can you show us a sample of your dataset, i mean 1 sample's mask and original image

Olamilekan002 · 2022-01-06T13:32:03Z

It seems the model is not learning at all

MGardien · 2022-04-05T09:59:54Z

After fine-tuning Faster-RCNN on my custom data, I was getting this error Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.000

@Suvi-dha @ashnair1 Can anybody help to resolve this issue...?

I had a similar issue. You should check the mask format. I could contain greycolors, where it should contain classes.

babanthierry94 · 2022-05-11T09:03:42Z

After fine-tuning Faster-RCNN on my custom data, I was getting this error Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.000

@Suvi-dha @ashnair1 Can anybody help to resolve this issue...?

I have the same issue. Please did someone solve it ?

Ann-ad · 2022-06-02T15:28:44Z

@babanthierry94 @MGardien What is your number of training steps?

Ann-ad · 2022-06-02T15:29:43Z

@babanthierry94 @MGardien , Try increasing the number of your training steps

nimakasipour · 2023-04-11T15:57:58Z

@babanthierry94 the metric works if you perform the object detection on all images in the coco dataset, i found out when i saved the ground truth bounding boxes from "instances_val2017.json" and tried to evaluate them with with these metrics (except to get 1.0 for all metrics). if some images are missing, then the result is lower or not 1.0, i used the fifty-one metrics which is also proposed on the homepage of the coco dataset, with fiftyone it was possible to evaluate the results even if they are not performed on all images

andycga · 2023-05-30T18:57:26Z

what does the -1 mean?

Suvi-dha closed this as completed Jun 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I interpret evaluation results on my custom dataset? #663

How do I interpret evaluation results on my custom dataset? #663

Suvi-dha commented Jun 11, 2018

qmzsuxing commented Jun 12, 2018

Suvi-dha commented Jun 12, 2018

ashnair1 commented Jan 24, 2019 •

edited

Loading

Suvi-dha commented Jan 24, 2019

ashnair1 commented Jan 24, 2019 •

edited

Loading

eyildiz-ugoe commented Mar 29, 2019 •

edited

Loading

banafsh89 commented Jul 29, 2019

Suvi-dha commented Jul 29, 2019 •

edited

Loading

Suvi-dha commented Jul 29, 2019

banafsh89 commented Jul 29, 2019

Suvi-dha commented Aug 1, 2019

sineagles commented Mar 18, 2020

sineagles commented Mar 21, 2020

jdmelendez commented May 7, 2020

WillianaLeite commented May 10, 2020

Joanne513 commented Jun 19, 2020

elia86 commented Jul 2, 2020

MALLI7622 commented Jan 19, 2021

keremtatlici commented Jan 27, 2021

Olamilekan002 commented Jan 6, 2022

MGardien commented Apr 5, 2022

babanthierry94 commented May 11, 2022

Ann-ad commented Jun 2, 2022

Ann-ad commented Jun 2, 2022

nimakasipour commented Apr 11, 2023

andycga commented May 30, 2023

How do I interpret evaluation results on my custom dataset? #663

How do I interpret evaluation results on my custom dataset? #663

Comments

Suvi-dha commented Jun 11, 2018

qmzsuxing commented Jun 12, 2018

Suvi-dha commented Jun 12, 2018

ashnair1 commented Jan 24, 2019 • edited Loading

Suvi-dha commented Jan 24, 2019

ashnair1 commented Jan 24, 2019 • edited Loading

eyildiz-ugoe commented Mar 29, 2019 • edited Loading

banafsh89 commented Jul 29, 2019

Suvi-dha commented Jul 29, 2019 • edited Loading

Suvi-dha commented Jul 29, 2019

banafsh89 commented Jul 29, 2019

Suvi-dha commented Aug 1, 2019

sineagles commented Mar 18, 2020

sineagles commented Mar 21, 2020

jdmelendez commented May 7, 2020

WillianaLeite commented May 10, 2020

Joanne513 commented Jun 19, 2020

elia86 commented Jul 2, 2020

MALLI7622 commented Jan 19, 2021

keremtatlici commented Jan 27, 2021

Olamilekan002 commented Jan 6, 2022

MGardien commented Apr 5, 2022

babanthierry94 commented May 11, 2022

Ann-ad commented Jun 2, 2022

Ann-ad commented Jun 2, 2022

nimakasipour commented Apr 11, 2023

andycga commented May 30, 2023

ashnair1 commented Jan 24, 2019 •

edited

Loading

ashnair1 commented Jan 24, 2019 •

edited

Loading

eyildiz-ugoe commented Mar 29, 2019 •

edited

Loading

Suvi-dha commented Jul 29, 2019 •

edited

Loading