Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collection of some problems in the first 600 lines of the training dataset #4

Open
graue70 opened this issue Sep 3, 2021 · 0 comments

Comments

@graue70
Copy link

graue70 commented Sep 3, 2021

These are some problems with the training dataset that I found by skimming the first 600 lines:

  • Is it true Jeff Bridges occupation Lane Chandler and photographer ? (question makes no sense)
  • Judi Densch (typo in question)
  • What is the boiling point of pressure copper as 4703.0? (question and paraphrase completely incorrect -> 'At which pressure does copper have a boiling point of 4703.0?')
  • Who Sleepwalking succeeded in playing Sleepwalking? (paraphrase same as question, makes no sense)
  • Could you summarize Korea's history of this topic? (question and Wikidata query make no sense)
  • Which is {landscape of} of {Virgin of the rocks}, which has {birth city} is {Tzippori} ? (question and paraphrase contain template strings, also additional quotes (\"))
  • How many dimensions have a Captain America? (question makes no sense)
  • What is the {neighborhood} for {shares border with} of {Los Angeles} (no question)
  • What sister city was born in of Zakhar Oskotsky? (question and paraphrase make no sense -> 'What are sister cities of the birth place of Zakhar Oskotsky?')
  • What is the musical score by Missa Solemnis that has mother Maria Magdalena van Beethoven? (question and paraphrase make no sense -> 'Which child of Maria Magdalena van Beethoven wrote the score Missa Solemnis?')
  • When did Robert De Nirolive in Marbletown? (typo in paraphrase)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant