You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
Thanks for the open-source code provided in this repository for the paper, and I'm very interested in the zero/few-shot ability as shown in your great work, but as I investigate to evaluate the model under zero-shot MLM+ITM setting in run_gqa_prompt_zero_few.py, I found that the ITM head (i.e. model.cls_ans.seq_relationship) is newly initialized (and of different shape from original pretrained ITM head) and actually not used in the evaluation. Am I wrong on the usage of this code or is ITM head not participating in the evaluation?
Thanks a lot for your time and attention.
The text was updated successfully, but these errors were encountered:
Hi,
Thanks for the open-source code provided in this repository for the paper, and I'm very interested in the zero/few-shot ability as shown in your great work, but as I investigate to evaluate the model under zero-shot MLM+ITM setting in
run_gqa_prompt_zero_few.py
, I found that the ITM head (i.e.model.cls_ans.seq_relationship
) is newly initialized (and of different shape from original pretrained ITM head) and actually not used in the evaluation. Am I wrong on the usage of this code or is ITM head not participating in the evaluation?Thanks a lot for your time and attention.
The text was updated successfully, but these errors were encountered: