Skip to content
This repository has been archived by the owner on Dec 5, 2023. It is now read-only.

Training log details #4

Open
dandelin opened this issue Aug 24, 2023 · 0 comments
Open

Training log details #4

dandelin opened this issue Aug 24, 2023 · 0 comments

Comments

@dandelin
Copy link

Hi! Thank you so much for sharing your work :)

Actually, I've been re-implementing MERU and found that the learnable curvature quickly drops to its lower bound of 0.1.
Also, the entailment loss was near zero during the training. (Though zero-shot performances on the classification and retrieval were fine.)

As I'm unsure if this is normal, before scrutinizing your code and re-running the training session with this repo, I would be grateful if you could share the training logs.

Thanks!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant