Skip to content
This repository has been archived by the owner on Feb 3, 2023. It is now read-only.

Rationalise Scripts and Run Final Experiments #15

Closed
5 of 7 tasks
j6mes opened this issue Dec 5, 2017 · 2 comments
Closed
5 of 7 tasks

Rationalise Scripts and Run Final Experiments #15

j6mes opened this issue Dec 5, 2017 · 2 comments
Assignees

Comments

@j6mes
Copy link
Member

j6mes commented Dec 5, 2017

To run

  • MLP: Train on FNC, Evaluate on FNC, Evaluate on FEVER 3 way
  • MLP: Train on FEVER with sampled negative pages, Test
  • MLP: Train on FEVER with IR negative pages, Test
  • DR: Final score for recall/precision/MRR
  • DR: Score using Oracle RTE component
  • RTE: Pre-trained model, evaluate on FEVER
  • RTE: Train on FEVER bodies, evaluate on FEVER

Extra:

  • BiDAF: Precision/Recall of pretrained model
  • BiDAF: FEVER Accuracy using pretrained model on DRQA Pages
  • RTE: Train on BiDAF retrieved model: evaluate P/R of BiDAF. Evaluate FEVER score
@andreasvlachos
Copy link
Member

andreasvlachos commented Dec 5, 2017 via email

@j6mes
Copy link
Member Author

j6mes commented Dec 6, 2017

The DR has no parameters, so there's nothing to learn.
Taking the top 5 articles at the moment. Will also try taking all articles above a threshold.

The only metric I've done is recall the recall, but testing with an oracle RTE is a good idea and easy for me to do too.

@j6mes j6mes closed this as completed Dec 12, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants