hi @hunto
I have been following your work for a long time and I am very excited that the code has been made public in the target classification task.
but I found a problem: when I use resnet-34(teacher) to train resnet-18(student) with B1 baseline setting on CIFAR-10 dataset, the curves of average γ can't match your result. The curve is close to 1 and can not descend.


These two images are the γ of the noisy adapter on the feature KD and logit KD respectively!
hi @hunto


I have been following your work for a long time and I am very excited that the code has been made public in the target classification task.
but I found a problem: when I use resnet-34(teacher) to train resnet-18(student) with B1 baseline setting on CIFAR-10 dataset, the curves of average γ can't match your result. The curve is close to 1 and can not descend.
These two images are the γ of the noisy adapter on the feature KD and logit KD respectively!