Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't reproduce reported results #17

Open
ruoshiliu opened this issue Jul 28, 2020 · 8 comments
Open

Can't reproduce reported results #17

ruoshiliu opened this issue Jul 28, 2020 · 8 comments

Comments

@ruoshiliu
Copy link

Hi, I ran the exact argument you provided in the README.md file
python run.py --data_dir=./data --num_epochs=30 --log_dir=./logs --batch_size=1024 --num_layers=2 --cell_type=hyp_gru
which yields a loss of 0.678 and precision of ~56%
I also ran
python run.py --data_dir=./data --num_epochs=30 --log_dir=./hyp_gru_hyp_decision --batch_size=1024 --num_layers=2 --cell_type=hyp_gru --decision_type hyp
which yields a loss of 0.693 and precision of ~50%
Do you know any potential reason for that?

@ruoshiliu ruoshiliu changed the title Can't obtain reported results Can't reproduce reported results Jul 28, 2020
@ferrine
Copy link
Owner

ferrine commented Jul 28, 2020

hmm, that is strange. We had updates in Geoopt and implementation of Poincare model is different now, this might affect the results. Did you check the old version of geoopt (before Stereographic is merged)?

@ruoshiliu
Copy link
Author

Yes, I installed version 0.1.2 of geeopt which is before the Stereographic update.

@ferrine
Copy link
Owner

ferrine commented Jul 28, 2020

That's a problem smth really seems to be broken. How does the training curve look like?

@ruoshiliu
Copy link
Author

ruoshiliu commented Jul 28, 2020

Please check the following tensorboard results, with orange being train curves and blue being valid curves. I obtained the following results with python run.py --data_dir=./data --num_epochs=30 --log_dir=./logs --batch_size=1024 --num_layers=2 --cell_type=hyp_gru
Screen Shot 2020-07-28 at 13 45 30
Screen Shot 2020-07-28 at 13 45 39
Screen Shot 2020-07-28 at 13 46 01

@SkyFishMoon
Copy link

It occurs also in the new version. But I found that if decode using method in another paper named Hyperbolic GCN, the precision will arise a lot, but still can't get the result in original paper

@ferrine
Copy link
Owner

ferrine commented Aug 30, 2020

From the curves I can't say the model has converged, did you try training more?

@ferrine
Copy link
Owner

ferrine commented Aug 30, 2020

Other suggestion is to increase learning rate a bit.

@SkyFishMoon
Copy link

This is the validation
image
and train result
image
a little rise in precision, but I didn't use the dist2plane as the last layer. I just use a linear + logmap0 + logsoftmax + nllloss as the last layer.
Another problem is I find that some parameters in your model are general torch.Parameter rather than geoopt.ManifoldParameter. Only the bias parameters are geoopt.ManifoldParameter. Does this matter?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants