Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to reproduce F1-scores for the monolingual case #6

Closed
siddjags opened this issue Apr 12, 2021 · 3 comments
Closed

Unable to reproduce F1-scores for the monolingual case #6

siddjags opened this issue Apr 12, 2021 · 3 comments

Comments

@siddjags
Copy link

I am currently trying to reproduce the results reported in Table 3. of the research paper. So far, I have tried to use "bert-base-multilingual-cased/uncased" as the pre-trained model for the French dataset(9a). However, the results on the test dataset do not match with those reported in the paper. I am not sure if I am using the correct pre-trained model and python packages. A few package versions in requirements.txt seem outdated and cause the training/inference script to break.

Could you please provide some guidance on how I should go about this?

Thanks in advance.

@punyajoy
Copy link
Member

You have finetuned the bert base uncased right? How much is the difference in the results?

@siddjags
Copy link
Author

Yes, I have finetuned the bert base uncased from Hugging Face. The results obtained on the French dataset (training size=16) are as follows:
Test f1 = 0.43418500716755204
Test acc = 0.5491803278688525
These values are quite different from the results reported in the paper.

@punyajoy
Copy link
Member

Is this over one run or averaged over multiple runs ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants