-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
output module LR/SVM #2
Comments
I read through the paper, still don't get the point. |
In the paper, 5.1.3 may help.
I think this is a kind of empirical(or heuristic) optimizations. |
It's sort of weird to me since the fully connected output layer is equivalent to linear model theoretically. |
Thank you for this awesome repo.
This is not actually a code issue, I'm just curious to ask. Do you have any idea why do we need an extra linear model or SVM for the prediction? I mean this module doesn't go through the backpropagation at all.
Or do you find some improvements using this LR or SVM compared with its fully connected output layer?
Thanks
The text was updated successfully, but these errors were encountered: