Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The evaluation result is not good. #2

Closed
suyanzhou626 opened this issue Aug 11, 2021 · 11 comments
Closed

The evaluation result is not good. #2

suyanzhou626 opened this issue Aug 11, 2021 · 11 comments

Comments

@suyanzhou626
Copy link

I run the commad:

python Expr.py --config configs/UACANet-L.yaml

However, the result listed as follows:
Screenshot from 2021-08-11 22-05-30
The official result of UACANet-L mentioned at README is

dataset              meanDic    meanIoU    wFm     Sm    meanEm    mae    maxEm    maxDic    maxIoU    meanSen    maxSen    meanSpe    maxSpe
-----------------  ---------  ---------  -----  -----  --------  -----  -------  --------  --------  ---------  --------  ---------  --------
CVC-300                0.910      0.849  0.901  0.937     0.977  0.005    0.980     0.913     0.853      0.940     1.000      0.993     0.997
CVC-ClinicDB           0.926      0.880  0.928  0.943     0.974  0.006    0.976     0.929     0.883      0.943     1.000      0.992     0.996
Kvasir                 0.912      0.859  0.902  0.917     0.955  0.025    0.958     0.915     0.862      0.923     1.000      0.983     0.987
CVC-ColonDB            0.751      0.678  0.746  0.835     0.875  0.039    0.878     0.753     0.680      0.754     1.000      0.953     0.957
ETIS-LaribPolypDB      0.766      0.689  0.740  0.859     0.903  0.012    0.905     0.769     0.691      0.813     1.000      0.932     0.936

Why is the result I run so bad? I didn't change any configuration file.

@plemeri
Copy link
Owner

plemeri commented Aug 12, 2021

Can you tell me your device setup? Also, please try our checkpoint from this URL.

@suyanzhou626
Copy link
Author

Can you tell me your device setup? Also, please try our checkpoint from this URL.

pytorch: 1.8
cuda:11.0
GPU: RTX3090

@plemeri
Copy link
Owner

plemeri commented Aug 12, 2021

I think it's pretty much not the device problem, but I also didn't have much experience with RTX3090. Also, I personally experienced poor results occasionally but it wasn't that often, so you can try another experiment with same settings. Please reopen this issue if you still have problem.

@plemeri plemeri closed this as completed Aug 12, 2021
@FrankWuuu
Copy link

I think it's pretty much not the device problem, but I also didn't have much experience with RTX3090. Also, I personally experienced poor results occasionally but it wasn't that often, so you can try another experiment with same settings. Please reopen this issue if you still have problem.

I run the commad:

python Expr.py --config configs/UACANet-L.yaml

However, the result listed as follows:
Screenshot from 2021-08-11 22-05-30
The official result of UACANet-L mentioned at README is

dataset              meanDic    meanIoU    wFm     Sm    meanEm    mae    maxEm    maxDic    maxIoU    meanSen    maxSen    meanSpe    maxSpe
-----------------  ---------  ---------  -----  -----  --------  -----  -------  --------  --------  ---------  --------  ---------  --------
CVC-300                0.910      0.849  0.901  0.937     0.977  0.005    0.980     0.913     0.853      0.940     1.000      0.993     0.997
CVC-ClinicDB           0.926      0.880  0.928  0.943     0.974  0.006    0.976     0.929     0.883      0.943     1.000      0.992     0.996
Kvasir                 0.912      0.859  0.902  0.917     0.955  0.025    0.958     0.915     0.862      0.923     1.000      0.983     0.987
CVC-ColonDB            0.751      0.678  0.746  0.835     0.875  0.039    0.878     0.753     0.680      0.754     1.000      0.953     0.957
ETIS-LaribPolypDB      0.766      0.689  0.740  0.859     0.903  0.012    0.905     0.769     0.691      0.813     1.000      0.932     0.936

Why is the result I run so bad? I didn't change any configuration file.

hello,sorry to bother you.
I just run the code as you say in the readme. But the rusult is also not that good. The only thing I changed is the batchszie,which is 8 in my code.The following is the evaluation result. So I don't know where is the problem.
result

@plemeri
Copy link
Owner

plemeri commented Sep 23, 2021 via email

@plemeri
Copy link
Owner

plemeri commented Sep 24, 2021

Hi, I had two additional experiments with batchsize 16 and 8 and here are the results.

[batchsize 16]
Screen Shot 2021-09-24 at 11 04 00 AM

[batchsize 8]
Screen Shot 2021-09-24 at 11 04 28 AM

I never knew that small batchsize would affect ETIS dataset this much, but turns out it does.
Let me know if you still have problems after increasing the batch size. Until then, I'll reopen this issue for convenience

@plemeri plemeri reopened this Sep 24, 2021
@FrankWuuu
Copy link

FrankWuuu commented Sep 24, 2021 via email

@FrankWuuu
Copy link

FrankWuuu commented Sep 25, 2021

Hello, I run it again with the batch size equal to 8. Here is the result.
image
It seems it improved somewhere.
Thanks again.

@sandeepsinghsengar
Copy link

Hi,
In place of email, I am writing my comment here (others can also take advantage)[as I am also facing the same problem like above comments]. Now, I am running three experiments with (1) batch size 32, 24 GB RAM, titanrtx, total nodes=3 (2) batch size 32, 24 GB RAM, titanrtx, total node=1) (3) batch size 32, 24 GB RAM, titanx, total node=2). I will update you on the results soon.

@sandeepsinghsengar
Copy link

Following are the results:
dataset meanDic meanIoU wFm Sm meanEm mae maxEm maxDic maxIoU meanSen maxSen meanSpe maxSpe


CVC-300 0.909 0.846 0.895 0.937 0.977 0.005 0.980 0.913 0.850 0.960 1.000 0.992 0.996
CVC-ClinicDB 0.923 0.878 0.926 0.938 0.971 0.007 0.974 0.927 0.881 0.931 1.000 0.993 0.997
Kvasir 0.898 0.844 0.891 0.910 0.941 0.028 0.944 0.901 0.847 0.899 1.000 0.974 0.978
CVC-ColonDB 0.741 0.672 0.734 0.829 0.856 0.038 0.859 0.743 0.674 0.759 1.000 0.914 0.918
ETIS-LaribPolypDB 0.684 0.617 0.659 0.812 0.863 0.019 0.865 0.686 0.619 0.733 1.000 0.849 0.853

dataset meanDic meanIoU wFm Sm meanEm mae maxEm maxDic maxIoU meanSen maxSen meanSpe maxSpe


CVC-300 0.909 0.846 0.895 0.937 0.976 0.005 0.979 0.912 0.849 0.958 1.000 0.992 0.996
CVC-ClinicDB 0.933 0.886 0.933 0.943 0.981 0.006 0.984 0.936 0.889 0.946 1.000 0.992 0.996
Kvasir 0.901 0.847 0.890 0.910 0.947 0.028 0.950 0.904 0.850 0.907 1.000 0.981 0.985
CVC-ColonDB 0.756 0.687 0.749 0.837 0.872 0.036 0.875 0.758 0.689 0.766 1.000 0.911 0.914
ETIS-LaribPolypDB 0.713 0.641 0.690 0.829 0.857 0.012 0.859 0.715 0.644 0.751 1.000 0.888 0.891

These are still not as claimed on paper (however, there is an improvement with batch size 32).

@plemeri
Copy link
Owner

plemeri commented Oct 25, 2021

I'm closing this issue since there are no other updates.

@plemeri plemeri closed this as completed Oct 25, 2021
@plemeri plemeri mentioned this issue May 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants