Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LANGMODEL_DIR=<Path to BioLM: it should be trained on SQuAD> #1

Open
gongye19 opened this issue Aug 22, 2022 · 1 comment
Open

LANGMODEL_DIR=<Path to BioLM: it should be trained on SQuAD> #1

gongye19 opened this issue Aug 22, 2022 · 1 comment

Comments

@gongye19
Copy link

Sorry to bother you, but where could i find the pretrained LM on SQuAD, it seems the model in this work was not provided.

@wonjininfo
Copy link
Member

wonjininfo commented Aug 22, 2022

Hi @gongye19 ,
Thank you for your interest in our work.

Yes, we did not release the SQuAD-trained checkpoint at the moment, but you can train it with the same code in the repository.

We will further consider releasing SQuAD-trained checkpoints soon, but this will take some time. So I'd suggest you train the model by yourself.


Script for training on the SQuAD dataset is also available in the example script file. (Please note that the training codes for SQuAD are commented out.)

For example, the following codes set hyper-parameters for training the model on the SQuAD dataset.

# To train LMs on SQuAD, un-comment these lines
#export LANGMODEL_DIR=dmis-lab/biobert-v1.1
#export DATA_DIR=<Path to DATA>/DATA/SQuAD-SeqTagQA/squad-20201030
#export MAX_EPOCH=50
#export LEARN_RATE=5e-5
#export OUTPUT_DIR=<Output path>/20220130-squad_${METHOD}_lr${LEARN_RATE}_iter-${NUM_ITER}

(Please also comment out the # To train LMs on List-question dataset part in the following lines in the same file).

Thank you!
Best,
Wonjin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants