Question about QA-fine-tuning #16

czh17 · 2022-08-04T06:46:23Z

Hi Apoorv, nice work. I have some issue about the QA-fine-tuning.
I experimented with the MetaQA dataset using the code under the apoorv-dump branch with the following training detials：

model_size: T5-small
pointcheck: 3330000.pt (kgc task results on Wikidata5M : 21.6 (Hits@1))
epoch: 60 batchsize: 64
INPUT(‘predict answer: Topic Entity token | question token with NE |’) OUTPUT(‘ answer token ’)

However, the best accuracy of my model on the qa_test set was only 40.7%/12.9%/26.6% (1-hop/2-hop/3hop).
Am I missing some details during the experiment that make it less accurate? Please let me know. It would be great if you could give me a pointcheck with high accuracy.

apoorvumang · 2022-08-10T07:57:47Z

Hi @czh17 , thanks for your interest.

Can you give more details on how you trained/pretrained? i.e. the exact commands you ran + dataset processing done.

For results reported in the paper, you need to pretrain on the MetaQA KG, not on Wikidata5M. Subsequently you need to finetune on the QA dataset

czh17 · 2022-08-11T09:01:30Z

Thank you for your reply.

For results reported in the paper, you need to pretrain on the MetaQA KG, not on Wikidata5M. Subsequently you need to finetune on the QA dataset

Yes, I realized the problem you suggested, so I reproduced the whole experiment again. However, the experiment did not work very well. The details of the experiment and the exact commands are as follows.

For KGC pretrain on the MetaQA KG :

dataset: ‘data_kgqa//MetaQA_1hop_half//train_kgc_lines.txt' (only)
optimizer: adafactor
learning_rate: 1e-4
epoch: 200
model_size: small

In this stage, I use main_accelerate.py under the main branch for training. I observed that the loss of the model did not decrease and would appear to go from small to large and then to small again. For example, (epoch loss: 100->500->2000->90->400). I set the learning rate to 1e-5 as well as 1e-6, but the problem does not seem to be alleviated.

For KBQA fine-tuning on the MetaQA :

dataset: f‘data_kgqa//MetaQA_{hops}hop_half//train.txt' (hops = [1,2,3]) and 'qa_test.txt'
optimizer: adafactor
learning_rate: 1e-4
epoch: 60
pointcheck: (The kgc model with the smallest loss).pt
beam_size: 1

In this stage, I also use main_accelerate.py under the main branch for training. For inference, the qa pair in qa_test.text, is converted to the form ' predict answer: Topic Entity token | question token with NE |/t answer token '. Meanwhile, I rewrote the eval function based on eval_accelerate.py under the apoorv-dump branch, whose evaluation criterion is that if the token generated by the model is in the answer list, then the answer is judged to be correct.

Please let me know if there are any mistakes or details that I should have noticed in the above training process. Thanks again for your reply.

apoorvumang · 2022-08-12T08:47:41Z

Hmm, seems weird that loss fluctuates like that. Can you please post the exact commands you executed?

apoorvumang · 2022-08-12T08:50:04Z

Also, I would suggest you take a look at #11 as well, for details on how you can train the model in 1 go (concatenating qa and kgc lines).

I will try to post the pretrained checkpoints as well soon

czh17 · 2022-08-15T06:59:45Z

Hmm, seems weird that loss fluctuates like that. Can you please post the exact commands you executed?

Yes, this loss fluctuation phenomenon is very confusing to me. Here are the commands I executed :

python main_accelerate.py --save_prefix MetaQA_kgc_200_epoch --model_size base --dataset data_kgqa/MetaQA_1hop_half --split train_kgc_lines --batch_size 64 --save_steps 5000 --loss_steps 500 --learning_rate 0.0001

In this experiment, I have changed line 139 of main_accelerate.py to T5ForConditionalGeneration.from_pretrained('t5-base').

apoorvumang · 2022-08-25T12:31:19Z

Let me try and get back to you, sorry for the delay

czh17 · 2022-09-29T07:05:58Z

Would you mind sharing the code for the KBQA fine-tuning? This is very important for my research work, thanks again.

czh17 closed this as completed Oct 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about QA-fine-tuning #16

Question about QA-fine-tuning #16

czh17 commented Aug 4, 2022 •

edited

Loading

apoorvumang commented Aug 10, 2022

czh17 commented Aug 11, 2022

apoorvumang commented Aug 12, 2022

apoorvumang commented Aug 12, 2022

czh17 commented Aug 15, 2022 •

edited

Loading

apoorvumang commented Aug 25, 2022

czh17 commented Sep 29, 2022

Question about QA-fine-tuning #16

Question about QA-fine-tuning #16

Comments

czh17 commented Aug 4, 2022 • edited Loading

apoorvumang commented Aug 10, 2022

czh17 commented Aug 11, 2022

apoorvumang commented Aug 12, 2022

apoorvumang commented Aug 12, 2022

czh17 commented Aug 15, 2022 • edited Loading

apoorvumang commented Aug 25, 2022

czh17 commented Sep 29, 2022

czh17 commented Aug 4, 2022 •

edited

Loading

czh17 commented Aug 15, 2022 •

edited

Loading