Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Having trouble reproducting results in the paper #18

Closed
sqsalt opened this issue Sep 22, 2021 · 1 comment
Closed

Having trouble reproducting results in the paper #18

sqsalt opened this issue Sep 22, 2021 · 1 comment

Comments

@sqsalt
Copy link

sqsalt commented Sep 22, 2021

Thanks for the amazing idea proposed in your paper.
But I have some trouble reproducting results on SIGHAN15 dataset. I trained a TtT model using HybirdSet train set for almost 40 epoches, achiving 98% ACC on HybirdSet dev set,but performs 0.77 F1. Even worse, it decreases sharply to P/R/F1 0.50/0.76/0.61 on SIGHAN15 test set.
Without detailed training settings or open resource model, I have no idea of which part is failed in my model, so here are the questions:

  1. what is the ideal training loss on HybirdSet, containing nll_loss and crf loss
  2. what is the expected ACC/F1 on HybirdSet
  3. is the weight of fc layer(language model layer) randomly initialed or copied from embedding layer of BERT
    Thanks!
@lipiji
Copy link
Owner

lipiji commented Dec 8, 2021

Will release it before 31 Dec.

@lipiji lipiji closed this as completed Dec 8, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants