New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vocab is not accessible, it is in gs://t5-data #14
Comments
Hey @cnut1648 , Sorry for the late response. On the Chemprot task, we found out that we used the same eval approach as in scibert. Therefore, Chemprot result reported in the paper is not comparable to other recent works which use different eval approaches. The new results for Scifive on chemprot should be: Let me know if you still want the code to reproduce this result |
Hi @justinphan3110 thank you for the info! |
@cnut1648 I have just updated the script to use the similar chemprot dataset as the BLURB Leaderboard. You can access it here. Let me know if you still have any question |
Hi @justinphan3110 thanks! I can reproduce the results now! Great work! |
Hello, first of all, I want to say nice work!
When I want to reproduce your results on chemprot, I notice the following auth issue in the code
Turns out that this is caused by not being able to find vocab which is in
'gs://t5-data/vocabs/cc_all.32000/sentencepiece.model'
. But currently onlygs://scifive
is accessible.Could you please release the vocab or share with us how exactly did you obtain the sentencepiece vocab so that we can reproduce the results? Thank you!
The text was updated successfully, but these errors were encountered: