Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about then loss function #8

Open
blue-xuhu opened this issue Aug 21, 2020 · 3 comments
Open

question about then loss function #8

blue-xuhu opened this issue Aug 21, 2020 · 3 comments

Comments

@blue-xuhu
Copy link

blue-xuhu commented Aug 21, 2020

hello
i,read your code.i have some question.
Is the loss function unique?
is the way you do about loss function same with Original author?
i am not sure about this and the paper don't say it clearly yet

@Mephisto405
Copy link
Owner

Yeah, the loss function I implemented in this repository is exactly the same as the one in the reference paper. Take a look at the formula (2) on page 96. LL4AL paper link
If you mean the 'target loss' for the CIFAR10 classification task, I think the authors use cross-entropy loss according to the 3rd-6th line of page 98. Since the cross-entropy loss is a typical choice for the CIFAR10 classification task.

@blue-xuhu
Copy link
Author

Thanks for your reply about loss loss function。
I ran into another problem during the experiment。When using the initialized data set for the first training on the paper,The method of this paper and other methods have achieved almost the same accuracy。I refer to your code, tried many times but couldn't do this. The method in this paper uses the target model and lossnet, while other methods only use the target model. So how did they get the same accuracy in the first training?

@Bardielz
Copy link

Bardielz commented Dec 2, 2020

Thanks for your reply about loss loss function。
I ran into another problem during the experiment。When using the initialized data set for the first training on the paper,The method of this paper and other methods have achieved almost the same accuracy。I refer to your code, tried many times but couldn't do this. The method in this paper uses the target model and lossnet, while other methods only use the target model. So how did they get the same accuracy in the first training?

helllo,did you finish the problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants