Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

output module LR/SVM #2

Closed
berlino opened this issue Jun 11, 2017 · 3 comments
Closed

output module LR/SVM #2

berlino opened this issue Jun 11, 2017 · 3 comments

Comments

@berlino
Copy link

berlino commented Jun 11, 2017

Thank you for this awesome repo.
This is not actually a code issue, I'm just curious to ask. Do you have any idea why do we need an extra linear model or SVM for the prediction? I mean this module doesn't go through the backpropagation at all.
Or do you find some improvements using this LR or SVM compared with its fully connected output layer?

Thanks

@berlino
Copy link
Author

berlino commented Jun 11, 2017

I read through the paper, still don't get the point.

@galsang
Copy link
Owner

galsang commented Jun 12, 2017

In the paper, 5.1.3 may help.

5.1.3 Classifier
We found that performance increases if we do
not use the output of the LR layer as the final decision,
but instead train linear SVM or logistic regression
with default parameters directly on the input
to the LR layer (i.e., on the kn similarity scores
that are generated by the k-block stack after network
training is completed). Direct training of SVMs/LR
seems to get closer to the global optimum than gradient
descent training of CNNs.

I think this is a kind of empirical(or heuristic) optimizations.
Actually, the performance seems slightly better when applying those additional modules.

@berlino
Copy link
Author

berlino commented Jun 12, 2017

It's sort of weird to me since the fully connected output layer is equivalent to linear model theoretically.
Thanks.

@berlino berlino closed this as completed Jun 12, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants