Observing very strong Euclidean baseline results for reconstruction ... #35

0xSameer · 2019-06-24T14:42:47Z

Hi,

I was able to replicate the results for Poincare and Lorentz manifolds as reported in your publications. However, when recreating the Euclidean baselines I am noticing much stronger reconstruction scores. For example, with the following changes to ./train-nouns.sh:

-manifold euclidean
-dims 200
-lr 1.0

After just 200 epochs, we get:

json_stats: {"epoch": 199, ..., "mean_rank": 1.69, "map_rank": 0.90}

And after 1400 epochs, we get:

"mean_rank": 1.19, "map_rank": 0.95

No other changes were made to the code. Are we doing something wrong?
Note that we had to add an entry into the train-nouns.sh script for Euclidean manifold and used the same learning rate as specified for the Poincare manifold (1.0), and not the default of 1000.0 set in the code.

Thanks!

The text was updated successfully, but these errors were encountered:

0xSameer · 2019-06-24T18:24:54Z

Correction in the hyperparameters description above. We are using:

-dim 200

not "dims" ...

mnick · 2019-08-29T22:48:01Z

I wanted to quickly follow up on our separate email conversation: We identified
the issue and are finishing additional experiments. I will update this issue in
the next days with the results. Thanks again for filing this!

HHalva · 2020-07-24T09:46:09Z

I am still seeing this. Is there any update? Would be very important for everyone to know what's going on. Are the results presented in the NIPS'17 paper wrong/misleading?

martinwhl · 2020-08-31T12:29:52Z

Same situation here, wondering if there's any update on this?

mnick · 2020-09-01T22:49:50Z

Thank you for raising this again and sorry for the delay. In addition to our follow-ups over email we should have updated this issue on Github as well.

Basically: The reason for the stronger Euclidean baseline using the open sourced code is that the paper used a different setting where the Euclidean embeddings were regularized (similar to previous work). With open sourcing the code we disabled this regularization by default and it turned out to work better (as pointed out by Sameer). Since it led to a stronger Euclidean baseline in higher dimensions we decided to keep it like that in the code. Hyperbolic embeddings provide a substantial performance improvement in lower dimensions, which is really the main focus of this work.

martinwhl · 2020-09-02T03:09:29Z

Thank you for raising this again and sorry for the delay. In addition to our follow-ups over email we should have updated this issue on Github as well.

Basically: The reason for the stronger Euclidean baseline using the open sourced code is that the paper used a different setting where the Euclidean embeddings were regularized (similar to previous work). With open sourcing the code we disabled this regularization by default and it turned out to work better (as pointed out by Sameer). Since it led to a stronger Euclidean baseline in higher dimensions we decided to keep it like that in the code. Hyperbolic embeddings provide a substantial performance improvement in lower dimensions, which is really the main focus of this work.

Sorry, I'm not sure I fully understand how the Euclidean embeddings were regularized... could you please explain a little bit more? A lot of thanks.

davecerr · 2020-11-28T10:07:14Z

@martinthewhale I think the idea is that since there is exponentially more "space" near the boundary of the Poincare ball, the easiest way for the algorithm to minimise the loss is to push all nodes outwards. This is a form of overfitting since we ideally want nodes that are higher in the original hierarchy to be kept closer to the centre of the ball. I believe this is achieved by regularising the norm of v in equation 6 in the paper. This means that for every parent(v)/child(u) relationship we consider we are always encouraging parents (nodes higher in the hierarchy) to stay closer to the origin.

HHalva mentioned this issue Jul 24, 2020

euclidean embedding #68

Open

HHalva mentioned this issue Aug 3, 2020

NIPS results not reproducible with this code. #72

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Observing very strong Euclidean baseline results for reconstruction ... #35

Observing very strong Euclidean baseline results for reconstruction ... #35

0xSameer commented Jun 24, 2019

0xSameer commented Jun 24, 2019

mnick commented Aug 29, 2019

HHalva commented Jul 24, 2020 •

edited

martinwhl commented Aug 31, 2020

mnick commented Sep 1, 2020

martinwhl commented Sep 2, 2020

davecerr commented Nov 28, 2020

Observing very strong Euclidean baseline results for reconstruction ... #35

Observing very strong Euclidean baseline results for reconstruction ... #35

Comments

0xSameer commented Jun 24, 2019

0xSameer commented Jun 24, 2019

mnick commented Aug 29, 2019

HHalva commented Jul 24, 2020 • edited

martinwhl commented Aug 31, 2020

mnick commented Sep 1, 2020

martinwhl commented Sep 2, 2020

davecerr commented Nov 28, 2020

HHalva commented Jul 24, 2020 •

edited