Global query: wrong repeating #10

shkarupa-alex · 2022-07-07T12:24:37Z

Model code has an issue in adapting global query for attention estimation: https://github.com/NVlabs/GCVit/blob/main/models/gc_vit.py#L223
Repeating along batch dimension results not in [1, 1, 1, 2, 2, 2, 3, 3, 3] bached element order, but in [1, 2, 3, 1, 2, 3, 1, 2, 3].
In the same time after window_partition key and value "windows" are odered as [1, 1, 1, 2, 2, 2, 3, 3, 3].

You can verify this by evaluating imagenet with batch size 1 and 16. Results would be different.
In my tests results for batch size 1 are always equals. And results for batch size 16 will be always different if we add stohasticy (shuffle batch items and labels simultaneously).

During training this bug leads to partly wrong attention matrix estimation (because around "B_//B - 1" portion of channels will try to attend to wrong batch elements).
I suppose that fixing this bug and finetuning existing checkpoints should get even better metrics.

Fixes NVlabs#10 and NVlabs#11

ahatamiz · 2022-07-07T14:27:46Z

@shkarupa-alex Thanks for your comment. I tested it again but get the correct order of images in the batch for both train and validation. We will test again using different Pytorch versions and hardware to ensure consistency.

shkarupa-alex · 2022-07-07T14:34:16Z

@ahatamiz you can reproduce issue in such way:

Add "import numpy as np" at the top of https://github.com/NVlabs/GCVit/blob/main/models/gc_vit.py
Add following code after this line https://github.com/NVlabs/GCVit/blob/main/validate.py#L224

            batch_size = input.shape[0]
            batch_range = np.arange(batch_size, dtype='int32')
            np.random.shuffle(batch_range)
            input = input[batch_range]
            target = target[batch_range]

Run evaluation with any batch size > 1 multiple times

You will always get different top1 and top5 accuracy

tridao · 2022-07-08T05:35:38Z

I ran into the same issue when playing with the current GCViT model code. It does seem that repeating q_global is mixing examples from different batches.
Perhaps there's an internal version of the model code that's different from this public version?

pamolchanov · 2022-07-20T23:18:39Z

We indeed refactored our internal version of the code and introduced some bugs trying to simplify things. Thanks @tridao @shkarupa-alex for helping to identify those issues. We have fixed them and retraining models to share in the repo. ETA is EOW.
It is worth to mention that current release weights still replicate numbers in the paper. We expect new models to do better.

ahatamiz · 2022-07-27T05:56:06Z

Hi @tridao

We have updated our models and re-trained them. In all cases, the accuracy is better compared to reported numbers in the paper, from our internal checkpoints. In addition, we have conducted a set of stress tests and verified there is no inconsistency with the model performance across different batch sizes at test time. Please check the new model weights and let us know if you face any issues.

shkarupa-alex added a commit to shkarupa-alex/GCVit that referenced this issue Jul 7, 2022

Fix global query processing

872a7b5

Fixes NVlabs#10 and NVlabs#11

shkarupa-alex mentioned this issue Jul 7, 2022

Fix global query processing #12

Closed

ahatamiz closed this as completed Jul 7, 2022

shkarupa-alex mentioned this issue Jul 18, 2022

Release Date shkarupa-alex/tfgcvit#1

Closed

ahatamiz mentioned this issue Jul 27, 2022

update GC ViT model #16

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Global query: wrong repeating #10

Global query: wrong repeating #10

shkarupa-alex commented Jul 7, 2022

ahatamiz commented Jul 7, 2022

shkarupa-alex commented Jul 7, 2022

tridao commented Jul 8, 2022

pamolchanov commented Jul 20, 2022

ahatamiz commented Jul 27, 2022

Global query: wrong repeating #10

Global query: wrong repeating #10

Comments

shkarupa-alex commented Jul 7, 2022

ahatamiz commented Jul 7, 2022

shkarupa-alex commented Jul 7, 2022

tridao commented Jul 8, 2022

pamolchanov commented Jul 20, 2022

ahatamiz commented Jul 27, 2022