Questions about ZegCLIP training #7

Qyizos · 2023-05-01T04:09:27Z

I am very happy to see your work on ZegCLIP, it is very interesting and very helpful for me. I'm having a little trouble with your code.

I used the pth file you provided for inference and got results consistent with the paper. However, I use the same docker environment and source code for training, but there is a certain deviation in the inference results obtained.When running Inductive setting under the VOC dataset, my experimental results differ from yours by less than 2 points. However, when running Inductive setting under the COCO dataset, there is a difference of as much as 7 points. The experimental results are shown in the attachment.

Can you help answer this question?

ZiqinZhou66 · 2023-05-01T04:30:39Z

I appreciate your interest in our work.

Could you please confirm that the mask weight you used in

ZegCLIP/configs/coco/vpt_seg_fully_vit-b_512x512_80k_12_100_multi.py

Line 71 in aa63859

mask_weight=100.0, #20.0

is 20 or 100? Because I noticed that the parameter in the original config is 20, but I do use 100 for training so I uploaded a new config. It may affect the performance if used the old version.

Besides, a widely used trick in many previous works that slightly reduces the logits on seen classes is helpful in the inductive setting. I have set the parameter to 0.1. Did it also use in your inference? Or you may try to change the factor to see the difference.

Qyizos · 2023-05-14T02:45:07Z

Thank for your eager reply, this phenomenon is caused by the fact that I ignored the number of iterations of the model.

In the MMSeg, batchSize = GPUNum * samples_per_gpu. Your paper has mentioned that it is using 4 GPUs for training, and I ignored this condition. I was using only a single card, so the amount of training data was only 1/4 of yours. After I made up the full number of training sessions, the method performance was significantly improved.

However, it is still slightly less effective than the paper by 1-2 points. I think this is because the number of trainings increased exponentially and I didn't change the super parameters like learning rate accordingly.

Thank you once again!

ZiqinZhou66 · 2023-05-18T02:08:54Z

Thank you for your feedback.
I may need to be more specific that making sure the batch size is 16 when reproducing the effect of our work.
Best of luck with your research.

aliman80 · 2023-08-29T11:11:37Z

@Qyizos Hi, I am trying to validate the results for cocostuff164k but i got very good results for 11 classes but for all the rest its zero. Can you guide what am i missing here. I run the code with only updated datapath and rest of repo code was same

Qyizos closed this as completed May 18, 2023

ZiqinZhou66 mentioned this issue Jun 27, 2023

Cannot reproduce the results. far from the paper. #10

Open

ZiqinZhou66 mentioned this issue Aug 28, 2023

Using ZegCLIP on my custom data #15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about ZegCLIP training #7

Questions about ZegCLIP training #7

Qyizos commented May 1, 2023

ZiqinZhou66 commented May 1, 2023

Qyizos commented May 14, 2023

ZiqinZhou66 commented May 18, 2023

aliman80 commented Aug 29, 2023

Questions about ZegCLIP training #7

Questions about ZegCLIP training #7

Comments

Qyizos commented May 1, 2023

ZiqinZhou66 commented May 1, 2023

Qyizos commented May 14, 2023

ZiqinZhou66 commented May 18, 2023

aliman80 commented Aug 29, 2023