About KNN in SCAN #4

zhunzhong07 · 2020-07-10T17:57:18Z

Hi, thanks for sharing your great work! I have a concern about the KNN in SCAN training.

In the Eq.2 of your paper, you calculate the loss by maximizing the similarities between each anchor and its KNN. However, in your code, it seems that you only maximize the similarity between each anchor and one of its randomly selected KNN, as the dataloder below.

Unsupervised-Classification/data/custom_dataset.py

Lines 71 to 72 in 69aed1c

    
           neighbor_index = np.random.choice(self.indices[index], 1)[0] 
        
           neighbor = self.dataset.__getitem__(neighbor_index)

I am not sure if my understanding is correct.

Thanks.

wvangansbeke · 2020-07-10T18:26:21Z

Hi @zhunzhong07,

Yes you're correct. We sample uniformly from the K nearest neighbors during training. Therefore, it is highly likely that the anchor sees a different neighbor in the next epoch. So, if you train long enough it should have the exact same effect as Eq. 2. After all, it is not practical to include a lot of neighbors for every sample during a forward pass, since this does not scale well with the amount K.

Hope this helps.

zhunzhong07 · 2020-07-10T18:32:50Z

Hi @wvangansbeke,

Thanks for your quick reply. I have another question.

In your code, I find that the indices of neighbors only computed once after the self-supervised learning. Why not re-computed the neighbors after each epoch of SCAN. Will this improve the results?

wvangansbeke · 2020-07-10T18:58:24Z

Yes a good point. I never tried it exactly like that (although something similar). It makes sense actually. However, I'm not sure that the representations are going to be much better at that point. I just think that it will be difficult to exploit the selflabling as we currently do. This step basicaly readjusts the decision boundary between classes and updates the representations based on the prototypes of each class.

zhunzhong07 · 2020-07-10T19:12:42Z

OK. Thanks for your reply!

zhunzhong07 closed this as completed Jul 10, 2020

This was referenced Jul 10, 2020

Regarding SCAN loss #6

Closed

some questions #7

Closed

wvangansbeke mentioned this issue Aug 13, 2020

About the SCANLoss #24

Closed

wvangansbeke mentioned this issue Sep 22, 2020

Backbone not frozen during SCAN training #33

Closed

wvangansbeke mentioned this issue May 14, 2021

About NeighborsDataset's neighbor and possible_neighbors #62

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About KNN in SCAN #4

About KNN in SCAN #4

zhunzhong07 commented Jul 10, 2020 •

edited

wvangansbeke commented Jul 10, 2020

zhunzhong07 commented Jul 10, 2020 •

edited

wvangansbeke commented Jul 10, 2020

zhunzhong07 commented Jul 10, 2020

About KNN in SCAN #4

About KNN in SCAN #4

Comments

zhunzhong07 commented Jul 10, 2020 • edited

wvangansbeke commented Jul 10, 2020

zhunzhong07 commented Jul 10, 2020 • edited

wvangansbeke commented Jul 10, 2020

zhunzhong07 commented Jul 10, 2020

zhunzhong07 commented Jul 10, 2020 •

edited

zhunzhong07 commented Jul 10, 2020 •

edited