-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do the last FC layers get out of the Hyperbolic space? #3
Comments
The final MLP uses the similarity scores to do the prediction part so its parameters are in the Euclidean space and are updated via vanille optimization methods. Only the word embeddings and label embeddings (and the bias in the hyperbolic RNN/GRU) are located in the hyperbolic space and are updated with Riemannian optimization methods. |
Does Riemannian Optimizer distinguish Euclidean parameters with Hyperbolic parameters? Lines 41 to 45 in c257d1c
|
For parameters in the hyperbolic manifold, RiemannianAdam is applied, while vanilla Adam is applied to other parameters in the Euclidean space. |
The network architecture begins with a few hyperbolic layers but ends with Euclidean layers as states in:
HyperIM/net/HyperIM.py
Line 32 in c257d1c
The full architecture was:
How can we optimize with Riemann SGD if not all the parameters are on hyperbolic space?
The text was updated successfully, but these errors were encountered: