Motivation of the method #3

hathawayxxh · 2022-10-16T13:16:30Z

Hello,

I have read your paper and find it very interesting. However, I may have some confusion about your method. If I understood correctly, the first eigenvector represents the latent distribution of a class, which is similar as the function of a prototype. And I also saw some methods utilize the similarity between a sample and the class-prototype to select clean samples. I would like to know what is the advantage of using the eigenvector over prototypes.

Thanks.

Kthyeon · 2022-10-17T06:52:32Z

Hello,

thanks for your interest. We agree that class prototype and our methods are quite similar under clean data settings. However, we think that prototype can be more contaminated towards label noise while ours does not. Furthermore, with the use of eigenvector, the perturbation is more correctly estimated. Your question looks great discussion topic for further research.

hathawayxxh · 2022-10-18T02:19:49Z

Yes, I understand that the first eigenvector might be less contaminated by label noise. However, in the scenario of high noise_rate (e.g., 90% in the CIFAR10 dataset), it means the ratio of clean data in each class is quite small, will this avoids the eigenvector from represeanting the real distribution of the class?

Kthyeon · 2022-10-18T08:02:27Z

That is a good point. In the pilot experiment, we find that eigenvectors are more robust than other prototype-based methods such as prototype, anchor generated from Mahalanobis distribution, or Minimum covariance detector (MCD) method.

In my thought, eigendecomposition seems quite robust towards noisy representations compared to prototype which is based on the concept of simple averaging

hathawayxxh · 2022-10-18T08:34:55Z

Thanks, I totally understand that the eigenvectors are more robust than prototype-based methods. However, in my experiments, I find the FINE sampling method performs much worse than the simplest small-loss methods. Have you met this delimma in your experiments?

Kthyeon · 2022-10-24T04:07:15Z

In our experiments, we have not met such situations before...., but can be dependent on the kinds of datasets.

The benefits of eigenvectors can be affected by the ability of backbone network. So, how about using warmup stage for the early training with significant magnitude of weight decay (or other regularization)? and then how about applying the FINE method with such pretrained arch?

Kthyeon closed this as completed Nov 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Motivation of the method #3

Motivation of the method #3

hathawayxxh commented Oct 16, 2022

Kthyeon commented Oct 17, 2022

hathawayxxh commented Oct 18, 2022

Kthyeon commented Oct 18, 2022

hathawayxxh commented Oct 18, 2022

Kthyeon commented Oct 24, 2022

Motivation of the method #3

Motivation of the method #3

Comments

hathawayxxh commented Oct 16, 2022

Kthyeon commented Oct 17, 2022

hathawayxxh commented Oct 18, 2022

Kthyeon commented Oct 18, 2022

hathawayxxh commented Oct 18, 2022

Kthyeon commented Oct 24, 2022