The MetaQA dataset #125

LinxiCai · 2022-11-10T07:52:28Z

when I read the train.txt, valid.txt and test.txt in data/MetaQA, I found the triples in test.txt are included in train.txt, could you explain why should this happen?

apoorvumang · 2022-11-11T06:00:24Z

Hi LinxiCai, thanks for your interest.

We studied MetaQA for the QA task, not KG completion task. We want to pretrain on the whole KG (or 50% KG depending on setting) and then finetune for QA. test.txt and valid.txt triples exist just for compatibility with KGE implementations, which require separate validation and test triples. So we simply copied triples from train.txt to test.txt to maintain compatibility.

LinxiCai · 2022-11-13T10:53:06Z

OK,thanks a lot !! I understand. By the way, I had another question, when I train embedding for metaQA triples, if I don't use dropout or batch_normalization, will there be overfitting? or can you share your training arguments when you get your MetaQA embedding?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The MetaQA dataset #125

The MetaQA dataset #125

LinxiCai commented Nov 10, 2022

apoorvumang commented Nov 11, 2022

LinxiCai commented Nov 13, 2022

The MetaQA dataset #125

The MetaQA dataset #125

Comments

LinxiCai commented Nov 10, 2022

apoorvumang commented Nov 11, 2022

LinxiCai commented Nov 13, 2022