Skip to content

monk1337/Awesome-Question-Answering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 

Repository files navigation

Awesome-Question-Answering

Awesome Question Answering

  1. MedMCQA: A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering https://proceedings.mlr.press/v174/pal22a.html

  2. Medical Exam Question Answering with Large-scale Reading Comprehension https://arxiv.org/pdf/1802.10279.pdf

  3. PubMedQA: A Dataset for Biomedical Research Question Answering https://www.aclweb.org/anthology/D19-1259.pdf

  4. emrQA: A Large Corpus for Question Answering on Electronic Medical Records https://www.aclweb.org/anthology/D18-1258.pdf

  5. QASC: A Dataset for Question Answering via Sentence Composition https://arxiv.org/pdf/1910.11473.pdf

  6. MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension https://arxiv.org/pdf/1910.00458.pdf

  7. SOCIAL IQA: Commonsense Reasoning about Social Interactions https://arxiv.org/pdf/1904.09728.pdf

  8. Improving Question Answering with External Knowledge https://arxiv.org/pdf/1902.00993.pdf

  9. Question Answering as Global Reasoning over Semantic Abstractions https://arxiv.org/pdf/1906.03672.pdf

  10. Automatic Question Answering for Medical MCQs: Can It Go Further than Information Retrieval? https://www.aclweb.org/anthology/R19-1049.pdf

  11. Overview of the MEDIQA 2019 Shared Task on Textual Inference, Question Entailment and Question Answering https://www.aclweb.org/anthology/W19-5039.pdf

  12. HEAD-QA: A Healthcare Dataset for Complex Reasoning https://www.aclweb.org/anthology/P19-1092.pdf

  13. Improving Retrieval-Based Question Answering with Deep Inference Models https://arxiv.org/pdf/1812.02971.pdf

  14. Explain Yourself! Leveraging Language Models for Commonsense Reasoning https://arxiv.org/pdf/1906.02361.pdf

  15. GenNet : Reading Comprehension with Multiple Choice Questions using Generation and Selection model https://arxiv.org/abs/2003.04360

  16. Rethinking the Value of Transformer Components https://www.aclweb.org/anthology/2020.coling-main.529.pdf

  17. A Corpus for Evidence Based Medicine Summarisation https://www.aclweb.org/anthology/U10-1012.pdf

  18. An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-015-0564-6

  19. Question-Driven Summarization of Answers to Consumer Health Questions https://arxiv.org/pdf/2005.09067.pdf

  20. A QUESTION-ENTAILMENT APPROACH TO QUESTION ANSWERING https://arxiv.org/pdf/1901.08079.pdf

  21. A dataset of clinically generated visual questions and answers about radiology images https://www.nature.com/articles/sdata2018251

  22. Overview of the Medical Question Answering Task at TREC 2017 LiveQA https://lhncbc.nlm.nih.gov/system/files/pub9773.pdf

  23. A Survey of Datasets for Biomedical Question Answering Systems https://thesai.org/Downloads/Volume8No7/Paper_67-A_Survey_of_Datasets_for_Biomedical_Question.pdf

  24. Lessons from Natural Language Inference in the Clinical Domain https://arxiv.org/pdf/1808.06752.pdf

  25. Beyond SQuAD: How to Apply a Transformer QA Model to Your Data https://qa.fastforwardlabs.com/domain%20adaptation/transfer%20learning/specialized%20datasets/qa/medical%20qa/2020/07/22/QA-for-Specialized-Data.html

  26. Applying deep matching networks to Chinese medical question answering: a study and a dataset https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-019-0761-8

  27. The AI2 Reasoning Challenge (ARC) dataset http://ai2-website.s3.amazonaws.com/publications/AI2ReasoningChallenge2018.pdf

  28. Interpretation of Natural Language Rules in Conversational Machine Reading https://arxiv.org/pdf/1809.01494.pdf

  29. MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text https://www.aclweb.org/anthology/D13-1020.pdf

  30. The Complexity of Math Problems – Linguistic, or Computational? https://uclnlp.github.io/ai4exams/_papers/I13-1009.pdf

  31. University Entrance Examinations as a Benchmark Resource for NLP-based Problem Solving https://uclnlp.github.io/ai4exams/_papers/I13-1192.pdf

  32. Overview of Todai Robot Project and Evaluation Framework of its NLP-based Problem Solving https://uclnlp.github.io/ai4exams/_papers/todai_overview.pdf

  33. Machine Comprehension with Discourse Relations https://uclnlp.github.io/ai4exams/_papers/P15-1121.pdf

  34. Machine Comprehension with Syntax, Frames, and Semantics https://uclnlp.github.io/ai4exams/_papers/P15-2115.pdf

  35. Learning Answer-Entailing Structures for Machine Comprehension https://uclnlp.github.io/ai4exams/_papers/P15-1024.pdf

  36. A Strong Lexical Matching Method for the Machine Comprehension Test https://uclnlp.github.io/ai4exams/_papers/D15-1197.pdf

  37. Teaching Machines to Read and Comprehend https://arxiv.org/pdf/1506.03340.pdf

  38. TOWARDS AI-COMPLETE QUESTION ANSWERING : A SET OF PREREQUISITE TOY TASKS https://arxiv.org/pdf/1502.05698.pdf

  39. THE GOLDILOCKS PRINCIPLE: READING CHILDREN’S BOOKS WITH EXPLICIT MEMORY REPRESENTATIONS https://research.fb.com/wp-content/uploads/2016/11/the_goldilocks_principle_reading_children_s_books_with_explicit_memory_representations.pdf?

  40. Machine Comprehension Based on Learning to Rank https://uclnlp.github.io/ai4exams/_papers/1605.03284v2.pdf

  41. Attention-Based Convolutional Neural Network for Machine Comprehension https://uclnlp.github.io/ai4exams/_papers/1602.04341v1.pdf

  42. Dynamic Entity Representation with Max-pooling Improves Machine Reading https://uclnlp.github.io/ai4exams/_papers/N16-1099.pdf

  43. A Parallel-Hierarchical Model for Machine Comprehension on Sparse Data https://uclnlp.github.io/ai4exams/_papers/1603.08884.pdf

  44. A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task https://uclnlp.github.io/ai4exams/_papers/1606.02858v1.pdf

  45. SQuAD: 100,000+ Questions for Machine Comprehension of Text https://uclnlp.github.io/ai4exams/_papers/1606.05250v1.pdf

  46. CliCR: A Dataset of Clinical Case Reports for Machine Reading Comprehension https://arxiv.org/pdf/1803.09720.pdf

  47. CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense https://arxiv.org/pdf/1904.04365.pdf

  48. CoQA: A Conversational Question Answering Challenge https://arxiv.org/pdf/1808.07042.pdf

  49. HOTPOTQA: A Dataset for Diverse, Explainable Multi-hop Question Answering https://www.aclweb.org/anthology/D18-1259.pdf

  50. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset https://arxiv.org/pdf/1611.09268.pdf

  51. Looking Beyond the Surface: A Challenge Set for Reading Comprehension over Multiple Sentences https://www.aclweb.org/anthology/N18-1023.pdf

  52. Natural Questions: a Benchmark for Question Answering Research https://persagen.com/files/misc/kwiatkowski2019natural.pdf

  53. NEWSQA: A MACHINE COMPREHENSION DATASET https://arxiv.org/pdf/1611.09830.pdf

  54. Constructing Datasets for Multi-hop Reading Comprehension Across Documents https://arxiv.org/pdf/1710.06481.pdf

  55. QuAC : Question Answering in Context https://arxiv.org/pdf/1808.07036.pdf

  56. RACE: Large-scale ReAding Comprehension Dataset From Examinations https://arxiv.org/pdf/1704.04683.pdf

  57. LSDSem 2017 Shared Task: The Story Cloze Test http://aclweb.org/anthology/W17-0906.pdf

  58. SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference https://arxiv.org/abs/1808.05326

  59. RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes https://www.aclweb.org/anthology/D18-1166.pdf

  60. The NarrativeQA Reading Comprehension Challenge https://arxiv.org/pdf/1712.07040.pdf

  61. DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs https://arxiv.org/pdf/1903.00161.pdf

  62. DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension https://arxiv.org/pdf/1804.07927.pdf

  63. COSMOS QA: Machine Reading Comprehension with Contextual Commonsense Reasoning https://arxiv.org/pdf/1909.00277.pdf

  64. RECLOR: A READING COMPREHENSION DATASET REQUIRING LOGICAL REASONING https://openreview.net/pdf?id=HJgJtT4tvB

  65. DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications https://www.aclweb.org/anthology/W18-2605.pdf

  66. QUASAR: DATASETS FOR QUESTION ANSWERING BY SEARCH AND READING https://arxiv.org/pdf/1707.03904.pdf

  67. SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine https://arxiv.org/pdf/1704.05179.pdf

  68. 9th Challenge on Question Answering over Linked Data (QALD-9) http://ceur-ws.org/Vol-2241/paper-06.pdf

  69. Good for figures and other ideas https://qa.fastforwardlabs.com/domain%20adaptation/transfer%20learning/specialized%20datasets/qa/medical%20qa/2020/07/22/QA-for-Specialized-Data.html

  70. BioASQ: A Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering http://bioasq.org/resources/papers/Tsatsaronis_et_al_AAAIIRKD2012_CR.pdf

  71. COVID-QA: A Question Answering Dataset for COVID-19 https://www.aclweb.org/anthology/2020.nlpcovid19-acl.18.pdf

  72. Semantic Parsing on Freebase from Question-Answer Pairs https://cs.stanford.edu/~pliang/papers/freebase-emnlp2013.pdf

  73. Large-scale Simple Question Answering with Memory Network https://arxiv.org/pdf/1506.02075.pdf

  74. Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus https://www.aclweb.org/anthology/P16-1056.pdf

  75. On Generating Characteristic-rich Question Sets for QA Evaluation https://sites.cs.ucsb.edu/~ysu/papers/emnlp16_graphquestions.pdf

  76. TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension https://www.aclweb.org/anthology/P17-1147.pdf

  77. LC-QuAD: A Corpus for Complex Question Answering over Knowledge Graphs http://jens-lehmann.org/files/2017/iswc_lcquad.pdf

  78. Know What You Don’t Know: Unanswerable Questions for SQuAD https://arxiv.org/pdf/1806.03822.pdf

  79. The Web as a Knowledge-base for Answering Complex Questions https://arxiv.org/abs/1803.06643

  80. FreebaseQA: A New Factoid QA Data Set Matching Trivia-Style Question-Answer Pairs with Freebase https://www.aclweb.org/anthology/N19-1028.pdf

  81. ComQA: A Community-sourced Dataset for Complex Factoid Question Answering with Paraphrase Clusters https://arxiv.org/pdf/1809.09528.pdf

  82. Natural Questions: A Benchmark for Question Answering Research https://www.aclweb.org/anthology/Q19-1026.pdf

  83. MEASURING COMPOSITIONAL GENERALIZATION: A COMPREHENSIVE METHOD ON REALISTIC DATA https://arxiv.org/pdf/1912.09713.pdf

  84. Large-scale Semantic Parsing via Schema Matching and Lexicon Extension https://www.aclweb.org/anthology/P13-1042.pdf

  85. WIKIQA: A Challenge Dataset for Open-Domain Question Answering https://pdfs.semanticscholar.org/8685/671ad8b1b1b5fe2b108c7002662b582ba277.pdf?_ga=2.255219788.515480105.1609930430-1235984517.1600607025&_gac=1.114743541.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  86. Good Question! Statistical Ranking for Question Generation https://www.aclweb.org/anthology/N10-1086.pdf

  87. Generating Natural Language Question-Answer Pairs from a Knowledge Graph Using a RNN Based Question Generation Model https://www.aclweb.org/anthology/E17-1036.pdf

  88. Learning to Ask: Neural Question Generation for Reading Comprehension https://arxiv.org/pdf/1705.00106.pdf

  89. Neural Question Generation from Text: A Preliminary Study https://arxiv.org/pdf/1704.01792.pdf

  90. Machine Comprehension by Text-to-Text Neural Question Generation https://arxiv.org/pdf/1705.02012.pdf

  91. Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks https://www.aclweb.org/anthology/D18-1424.pdf

  92. Capturing Greater Context for Question Generation https://arxiv.org/pdf/1910.10274.pdf

  93. Asking Questions the Human Way: Scalable Question-Answer Generation from Text Corpus https://arxiv.org/pdf/2002.00748.pdf

  94. Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps https://arxiv.org/pdf/2011.01060.pdf

  95. WIKIREADING: A Novel Large-scale Language Understanding Task over Wikipedia https://www.aclweb.org/anthology/P16-1145.pdf

  96. MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering https://arxiv.org/pdf/2007.15207.pdf

  97. KQA Pro: A Large-Scale Dataset with Interpretable Programs and Accurate SPARQLs for Complex Question Answering over Knowledge Base https://arxiv.org/pdf/2007.03875.pdf

  98. AMBIGQA: Answering Ambiguous Open-domain Questions https://arxiv.org/pdf/2004.10645.pdf

  99. TYDI QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages https://arxiv.org/pdf/2003.05002.pdf

  100. MLQA: Evaluating Cross-lingual Extractive Question Answering https://arxiv.org/pdf/1910.07475.pdf

  101. BREAK It Down: A Question Understanding Benchmark https://arxiv.org/pdf/2001.11770v1.pdf

  102. DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering https://www.aclweb.org/anthology/2020.acl-main.411.pdf

  103. RuBQ: A Russian Dataset for Question Answering over Wikidata https://arxiv.org/pdf/2005.10659.pdf

  104. Audio Visual Scene-Aware Dialog https://arxiv.org/pdf/1901.09107.pdf

  105. XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning https://www.aclweb.org/anthology/2020.emnlp-main.185.pdf

  106. DramaQA: Character-Centered Video Story Understanding with Hierarchical QA https://arxiv.org/pdf/2005.03356.pdf

  107. MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization https://www.aclweb.org/anthology/2020.acl-main.330.pdf

  108. Event-QA: A Dataset for Event-Centric Question Answering over Knowledge Graphs https://arxiv.org/pdf/2004.11861.pdf

  109. SelQA: A New Benchmark for Selection-based Question Answering https://arxiv.org/pdf/1606.08513.pdf

  110. Search-based Neural Structured Learning for Sequential Question Answering https://people.cs.umass.edu/~miyyer/pubs/2017_acl_dynsp.pdf

  111. Compositional Semantic Parsing on Semi-Structured Tables https://arxiv.org/pdf/1508.00305.pdf

  112. Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph aaai.org/ocs/index.php/AAAI/AAAI18/paper/viewFile/17181/15750

  113. HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data https://www.aclweb.org/anthology/2020.findings-emnlp.91.pdf

  114. Variational Reasoning for Question Answering with Knowledge Graph https://arxiv.org/pdf/1709.04071.pdf

  115. MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms https://arxiv.org/pdf/1905.13319.pdf

  116. A Span-Extraction Dataset for Chinese Machine Reading Comprehension https://www.aclweb.org/anthology/D19-1600.pdf

  117. DRCD: a Chinese Machine Reading Comprehension Dataset https://arxiv.org/ftp/arxiv/papers/1806/1806.00920.pdf

  118. KorQuAD1.0: Korean QA Dataset for Machine Reading Comprehension https://arxiv.org/pdf/1909.07005.pdf

  119. SberQuAD – Russian Reading Comprehension Dataset: Description and Analysis https://arxiv.org/pdf/1912.09723.pdf

  120. FQuAD: French Question Answering Dataset https://www.aclweb.org/anthology/2020.findings-emnlp.107.pdf

  121. On the Cross-lingual Transferability of Monolingual Representations https://arxiv.org/pdf/1910.11856.pdf

  122. Neural Arabic Question Answering https://arxiv.org/pdf/1906.05394.pdf

  123. LC-QuAD 2.0: A Large Dataset for Complex Question Answering over Wikidata and DBpedia http://jens-lehmann.org/files/2019/iswc_lcquad2.pdf

  124. Deep Bidirectional Transformers for Italian Question Answering http://ceur-ws.org/Vol-2481/paper25.pdf

  125. ELI5: Long Form Question Answering https://arxiv.org/pdf/1907.09190.pdf

  126. QUOREF: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning https://www.aclweb.org/anthology/D19-1606.pdf

  127. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning http://vision.stanford.edu/pdf/johnson2017cvpr.pdf

  128. A Corpus for Reasoning About Natural Language Grounded in Photographs https://arxiv.org/pdf/1811.00491.pdf

  129. FVQA: Fact-based Visual Question Answering https://arxiv.org/pdf/1606.05433.pdf

  130. APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK https://arxiv.org/pdf/1508.01585.pdf

  131. PIQA: Reasoning about Physical Commonsense in Natural Language https://arxiv.org/pdf/1911.11641.pdf

  132. Large-Scale QA-SRL Parsing https://arxiv.org/pdf/1805.05377v1.pdf

  133. Zero-Shot Relation Extraction via Reading Comprehension https://arxiv.org/pdf/1706.04115.pdf

  134. Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning https://arxiv.org/pdf/1703.06585.pdf

  135. ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering https://arxiv.org/pdf/1906.02467.pdf

  136. Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge https://arxiv.org/pdf/1803.05457.pdf

  137. Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems https://arxiv.org/pdf/1705.04146.pdf

  138. BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions https://arxiv.org/pdf/1905.10044.pdf

  139. CODAH: An Adversarially-Authored Question Answering Dataset for Common Sense https://arxiv.org/pdf/1904.04365.pdf

  140. COMMONSENSEQA: A Question Answering Challenge Targeting Commonsense Knowledge https://arxiv.org/pdf/1811.00937.pdf

  141. DVQA: Understanding Data Visualizations via Question Answering https://arxiv.org/pdf/1801.08163.pdf

  142. What’s in an Explanation? Characterizing Knowledge and Inference Requirements for Elementary Science Exams https://www.aclweb.org/anthology/C16-1278.pdf

  143. GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering https://arxiv.org/pdf/1902.09506.pdf

  144. DIALOGUE LEARNING WITH HUMAN-IN-THE-LOOP https://arxiv.org/pdf/1611.09823.pdf

  145. Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering https://www.aclweb.org/anthology/D18-1260.pdf

  146. Tracking State Changes in Procedural Text: A Challenge Dataset and Models for Process Paragraph Comprehension https://arxiv.org/pdf/1805.06975.pdf

  147. QUAREL: A Dataset and Models for Answering Questions about Qualitative Relationships https://arxiv.org/pdf/1811.08048.pdf

  148. QUARTZ: An Open-Domain Dataset of Qualitative Relationship Questions https://www.aclweb.org/anthology/D19-1608.pdf

  149. ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension https://arxiv.org/pdf/1810.12885.pdf

  150. Crowdsourcing Multiple Choice Science Questions https://arxiv.org/pdf/1707.06209.pdf

  151. SemEval-2016 Task 3: Community Question Answering https://www.aclweb.org/anthology/S16-1083.pdf

  152. Swag: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference https://www.aclweb.org/anthology/D18-1009.pdf

  153. Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence https://openaccess.thecvf.com/content_CVPR_2019/papers/Zadeh_Social-IQ_A_Question_Answering_Benchmark_for_Artificial_Social_Intelligence_CVPR_2019_paper.pdf

  154. Are You Smarter Than A Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension http://ai2-website.s3.amazonaws.com/publications/CVPR17_TQA.pdf

  155. Towards VQA Models That Can Read https://arxiv.org/pdf/1904.08920.pdf

  156. Dialog-based Language Learning https://arxiv.org/pdf/1604.06045.pdf

  157. DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset https://www.aclweb.org/anthology/I17-1099.pdf

  158. Large-scale Simple Question Answering with Memory Networks https://arxiv.org/pdf/1506.02075.pdf

  159. Key-Value Memory Networks for Directly Reading Documents https://arxiv.org/pdf/1606.03126.pdf

  160. What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA https://www.aclweb.org/anthology/D07-1003.pdf

  161. From Recognition to Cognition: Visual Commonsense Reasoning https://arxiv.org/pdf/1811.10830.pdf

  162. VQA: Visual Question Answering https://arxiv.org/pdf/1505.00468.pdf

  163. Who did What: A Large-Scale Person-Centered Cloze Dataset https://arxiv.org/pdf/1608.05457.pdf

  164. HOVER: A Dataset for Many-Hop Fact Extraction And Claim Verification https://www.aclweb.org/anthology/2020.findings-emnlp.309.pdf

  165. IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding https://arxiv.org/pdf/2009.05387.pdf

  166. LAReQA: Language-agnostic answer retrieval from a multilingual pool https://arxiv.org/pdf/2004.05484.pdf

  167. SUBJQA: A Dataset for Subjectivity and Review Comprehension https://www.aclweb.org/anthology/2020.emnlp-main.442.pdf

  168. EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering https://www.aclweb.org/anthology/2020.emnlp-main.438.pdf

  169. Open-Retrieval Conversational Question Answering https://arxiv.org/pdf/2005.11364.pdf

  170. Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases https://arxiv.org/pdf/2011.07743.pdf

  171. RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark https://arxiv.org/pdf/2010.15925.pdf

  172. OPEN QUESTION ANSWERING OVER TABLES AND TEXT https://arxiv.org/pdf/2010.10439.pdf

  173. LiveQA: A Question Answering Dataset over Sports Live https://arxiv.org/pdf/2010.00526.pdf

  174. Inquisitive Question Generation for High Level Text Comprehension https://arxiv.org/pdf/2010.01657.pdf

  175. Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension https://arxiv.org/pdf/1904.09679.pdf

  176. LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning https://arxiv.org/pdf/2007.08124.pdf

  177. QED: A Framework and Dataset for Explanations in Question Answering https://arxiv.org/pdf/2009.06354.pdf

  178. ConvAI3: Generating Clarifying Questions for Open-Domain Dialogue Systems (ClariQ) https://arxiv.org/pdf/2009.11352.pdf

  179. Visual Genome https://visualgenome.org/static/paper/Visual_Genome.pdf

  180. VisualCOMET: Reasoning about the Dynamic Context of a Still Image https://arxiv.org/pdf/2004.10796.pdf

  181. ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET https://arxiv.org/pdf/1808.02280.pdf

  182. Enhancing Lexical-Based Approach with External Knowledge for Vietnamese Multiple-Choice Machine Reading Comprehension https://arxiv.org/pdf/2001.05687.pdf

  183. FIGUREQA: AN ANNOTATED FIGURE DATASET FOR VISUAL REASONING https://arxiv.org/pdf/1710.07300.pdf

  184. TVQA: Localized, Compositional Video Question Answering https://arxiv.org/pdf/1809.01696.pdf

  185. MovieQA: Understanding Stories in Movies through Question-Answering https://arxiv.org/pdf/1512.02902.pdf

  186. TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering https://arxiv.org/pdf/1704.04497.pdf

  187. SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions https://www.microsoft.com/en-us/research/uploads/prod/2020/06/SQuINT_CVPR.pdf

  188. Open Domain Web Keyphrase Extraction Beyond Language Modeling https://arxiv.org/pdf/1911.02671.pdf

  189. DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension https://arxiv.org/pdf/1902.00164.pdf

  190. STARC: Structured Annotations for Reading Comprehension https://arxiv.org/pdf/2004.14797.pdf

  191. ANTIQUE: A Non-Factoid Question Answering Benchmark https://arxiv.org/pdf/1905.08957.pdf

  192. Quizbowl: The Case for Incremental Question Answering https://arxiv.org/pdf/1904.04792.pdf

  193. Evaluating Theory of Mind in Question Answering https://arxiv.org/pdf/1808.09352.pdf

  194. SemEval-2018 Task 5: Counting Events and Participants in the Long Tail https://www.aclweb.org/anthology/S18-1009.pdf

  195. Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques https://www.aclweb.org/anthology/W18-3204.pdf

  196. Modeling Ambiguity, Subjectivity, and Diverging Viewpoints in Opinion Question Answering Systems http://cseweb.ucsd.edu/~jmcauley/pdfs/icdm16c.pdf

  197. WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval https://arxiv.org/pdf/1805.03797.pdf

  198. Asking Clarifying Questions in Open-Domain Information-Seeking Conversations https://ciir-publications.cs.umass.edu/pub/web/getpdf.php?id=1339

  199. POIReviewQA: A Semantically Enriched POI Retrieval and estion Answering Dataset https://arxiv.org/pdf/1810.02802.pdf

  200. Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia https://www.aclweb.org/anthology/C18-1178.pdf

  201. Harvesting Paragraph-Level Question-Answer Pairs from Wikipedia https://arxiv.org/pdf/1805.05942.pdf

  202. QALM: a Benchmark for Question Answering over Linked Merchant Websites Data http://ceur-ws.org/Vol-1272/paper_113.pdf

  203. Beyond Accuracy: Behavioral Testing of NLP Models with CheckList https://www.aclweb.org/anthology/2020.acl-main.442.pdf

  204. Medical Exam Question Answering with Large-Scale Reading file:///Users/monk/Downloads/16582-76893-1-PB%20(1).pdf

  205. Biomedical Question Answering with SDNet https://web.stanford.edu/class/cs224n/reports/custom/15743952.pdf

  206. On the Role of Question Summarization and Information Source Restriction in Consumer Health Question Answering https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6568117/

  207. An ontology for clinical questions about the contents of patient notes sciencedirect.com/science/article/pii/S1532046411001961

  208. Verification of the Expected Answer Type for Biomedical Question Answering https://hal.archives-ouvertes.fr/hal-01759306/document

  209. Adapting and evaluating a deep learning language model for clinical why-question answering http://shorturl.at/epvJS

  210. JEC-QA: A Legal-Domain Question Answering Dataset https://arxiv.org/pdf/1911.12011.pdf

  211. Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions https://arxiv.org/pdf/1809.03707.pdf

  212. TWEETQA: A Social Media Focused Question Answering Dataset https://www.aclweb.org/anthology/P19-1496.pdf

  213. What do Models Learn from Question Answering Datasets? https://www.aclweb.org/anthology/2020.emnlp-main.190.pdf

  214. AmazonQA: A Review-Based Question Answering Task https://arxiv.org/pdf/1908.04364.pdf

  215. Just Ask: Learning to Answer Questions from Millions of Narrated Videos https://arxiv.org/pdf/2012.00451.pdf

  216. ProtoQA: A Question Answering Dataset for Prototypical Common-Sense Reasoning https://www.aclweb.org/anthology/2020.emnlp-main.85.pdf

  217. CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering https://arxiv.org/pdf/2010.16021.pdf

  218. Retrieving and Reading : A Comprehensive Survey on Open-domain Question Answering https://arxiv.org/pdf/2101.00774.pdf

  219. How Can We Know When Language Models Know? https://arxiv.org/pdf/2012.00955.pdf

  220. Understanding Dataset Design Choices for Multi-hop Reasoning https://arxiv.org/pdf/1904.12106.pdf

  221. Cross-Lingual Transfer Learning for Question Answerin https://arxiv.org/pdf/1907.06042.pdf

  222. LEAF-QA: Locate, Encode & Attend for Figure Question Answering https://arxiv.org/pdf/1907.12861.pdf

  223. The TechQA Dataset https://arxiv.org/pdf/1911.02984.pdf

  224. MANYMODALQA: Modality Disambiguation and QA over Diverse Inputs https://arxiv.org/pdf/2001.08034.pdf

  225. Look at the First Sentence: Position Bias in Question Answering https://arxiv.org/pdf/2004.14602.pdf

  226. DoQA - Accessing Domain-Specific FAQs via Conversational QA https://arxiv.org/pdf/2005.01328.pdf

  227. MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models https://arxiv.org/pdf/2005.02507.pdf

  228. PolicyQA: A Reading Comprehension Dataset for Privacy Policies https://www.aclweb.org/anthology/2020.findings-emnlp.66.pdf

  229. GoEmotions: A Dataset of Fine-Grained Emotions https://www.aclweb.org/anthology/2020.acl-main.372.pdf

  230. Revisiting the Open-Domain Question Answering Pipeline https://arxiv.org/pdf/2009.00914.pdf

  231. When in Doubt, Ask: Generating Answerable and Unanswerable Questions, Unsupervised https://arxiv.org/pdf/2010.01611.pdf

  232. A question-answering system for aircraft pilots’ documentation https://arxiv.org/pdf/2011.13284.pdf

  233. FORECASTQA: A Question Answering Challenge for Event Forecasting https://arxiv.org/pdf/2005.00792.pdf

  234. A Study on Efficiency, Accuracy and Document Structure for Answer Sentence Selection https://arxiv.org/pdf/2003.02349.pdf

  235. Ask Me Anything: Dynamic Memory Networks for Natural Language Processing https://arxiv.org/pdf/1506.07285.pdf

  236. AN EFFICIENT PASSAGE RANKING TECHNIQUE FOR A QA SYSTEM https://pdfs.semanticscholar.org/ec9b/120c4022cebcf8b02e76e56eb00406466ba7.pdf?_ga=2.134633397.515480105.1609930430-1235984517.1600607025&_gac=1.225407464.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  237. QA-It: Classifying Non-Referential It for Question Answer Pairs https://pdfs.semanticscholar.org/2edb/6575fda3a18db72b34a99702886826cd7093.pdf?_ga=2.134633397.515480105.1609930430-1235984517.1600607025&_gac=1.225407464.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  238. POIReviewQA: A Semantically Enriched POI Retrieval and Question Answering Dataset https://dl.acm.org/doi/pdf/10.1145/3281354.3281359

  239. A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset https://www.aclweb.org/anthology/W18-2607.pdf

  240. How You Ask Matters: The Effect of Paraphrastic Questions to BERT Performance on a Clinical SQuAD Dataset https://www.aclweb.org/anthology/2020.clinicalnlp-1.13.pdf

  241. Blindfold Baselines for Embodied QA https://arxiv.org/pdf/1811.05013.pdf

  242. DeepStory: Video Story QA by Deep Embedded Memory Networks https://arxiv.org/pdf/1707.00836.pdf

  243. ScholarlyRead: A New Dataset for Scientific Article Reading Comprehension https://pdfs.semanticscholar.org/ce06/1efbc055c8d1fe491b19161e935254a424c5.pdf?_ga=2.53360748.515480105.1609930430-1235984517.1600607025&_gac=1.180171990.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  244. ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion https://arxiv.org/pdf/2006.12146.pdf

  245. Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset https://arxiv.org/pdf/2005.00574.pdf

  246. XOR QA: Cross-lingual Open-Retrieval Question Answering https://arxiv.org/pdf/2010.11856.pdf

  247. FriendsQA: Open-Domain Question Answering on TV Show Transcripts https://pdfs.semanticscholar.org/dbee/2139c68932de618ad1d01844fdb0e90ada5b.pdf?_ga=2.28728184.515480105.1609930430-1235984517.1600607025&_gac=1.215503205.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  248. Frustratingly Easy Natural Question Answering https://arxiv.org/pdf/1909.05286.pdf

  249. ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering https://arxiv.org/pdf/1511.05960.pdf

  250. FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension https://arxiv.org/pdf/1908.05117.pdf

  251. AnswerFact: Fact Checking in Product Question Answering∗ https://pdfs.semanticscholar.org/4c61/df1b4b9a164fec1a34587b4fffae029cd18c.pdf?_ga=2.58593390.515480105.1609930430-1235984517.1600607025&_gac=1.180401621.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  252. Matching Questions and Answers in Dialogues from Online Forums https://arxiv.org/pdf/2005.09276.pdf

  253. Question Answering with Long Multiple-Span Answers https://pdfs.semanticscholar.org/4e76/7b191683dca0ea6d33216508159886144da4.pdf?_ga=2.58593390.515480105.1609930430-1235984517.1600607025&_gac=1.180401621.1607092635.Cj0KCQiA2af-BRDzARIsAIVQUOdNiV5qT_0YS1w4gKgpaTrSKxbqsipzwnWWzkdgQU7V98y7gdbJcUQaAnquEALw_wcB

  254. Measuring Emotions in the COVID-19 Real World Worry Dataset https://www.aclweb.org/anthology/2020.nlpcovid19-acl.11.pdf

  255. What Are People Asking About COVID-19? A Question Classification Dataset https://www.aclweb.org/anthology/2020.nlpcovid19-acl.8.pdf

  256. MultiWOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines https://www.aclweb.org/anthology/2020.nlp4convai-1.13.pdf

  257. Developing a How-to Tip Machine Comprehension Dataset and its Evaluation in Machine Comprehension by BERT https://www.aclweb.org/anthology/2020.fever-1.4.pdf

  258. BIOMRC: A Dataset for Biomedical Machine Reading Comprehension https://www.aclweb.org/anthology/2020.bionlp-1.15.pdf

  259. CIMA: A Large Open Access Dialogue Dataset for Tutoring https://www.aclweb.org/anthology/2020.bea-1.5.pdf

  260. Building a Japanese Typo Dataset from Wikipedia’s Revision History https://www.aclweb.org/anthology/2020.acl-srw.31.pdf

  261. SCIREX: A Challenge Dataset for Document-Level Information Extraction https://www.aclweb.org/anthology/2020.acl-main.670.pdf

  262. ClarQ: A large-scale and diverse dataset for Clarification Question Generation https://www.aclweb.org/anthology/2020.acl-main.651.pdf

  263. SCDE: Sentence Cloze Dataset with High Quality Distractors From Examinations https://www.aclweb.org/anthology/2020.acl-main.502.pdf

  264. Storytelling with Dialogue: A Critical Role Dungeons and Dragons Dataset https://www.aclweb.org/anthology/2020.acl-main.459.pdf

  265. Fatality Killed the Cat or: BabelPic, a Multimodal Dataset for Non-Concrete Concepts https://www.aclweb.org/anthology/2020.acl-main.425.pdf

  266. CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality https://www.aclweb.org/anthology/2020.acl-main.343.pdf

  267. MIND: A Large-scale Dataset for News Recommendation https://www.aclweb.org/anthology/2020.acl-main.331.pdf

  268. Will-They-Won’t-They: A Very Large Dataset for Stance Detection on Twitter https://www.aclweb.org/anthology/2020.acl-main.157.pdf

  269. MuTual: A Dataset for Multi-Turn Dialogue Reasoning https://www.aclweb.org/anthology/2020.acl-main.130.pdf

  270. Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets https://www.aclweb.org/anthology/W19-5006.pdf

  271. A distantly supervised dataset for automated data extraction from diagnostic studies https://www.aclweb.org/anthology/W19-5012.pdf

  272. A High-Quality Multilingual Dataset for Structured Documentation Translation https://www.aclweb.org/anthology/W19-5212.pdf

  273. MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations https://www.aclweb.org/anthology/P19-1050.pdf

  274. DocRED: A Large-Scale Document-Level Relation Extraction Dataset https://www.aclweb.org/anthology/P19-1074.pdf

  275. ChID: A Large-scale Chinese IDiom Dataset for Cloze Test https://www.aclweb.org/anthology/P19-1075.pdf

  276. Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model https://www.aclweb.org/anthology/P19-1102.pdf

  277. The PhotoBook Dataset: Building Common Ground through Visually-Grounded Dialogue https://www.aclweb.org/anthology/P19-1184.pdf

  278. TALKSUMM: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks https://www.aclweb.org/anthology/P19-1204.pdf

  279. BIGPATENT: A Large-Scale Dataset for Abstractive and Coherent Summarization https://www.aclweb.org/anthology/P19-1212.pdf

  280. XQA: A Cross-lingual Open-domain Question Answering Dataset https://www.aclweb.org/anthology/P19-1227.pdf

  281. Dataset Creation for Ranking Constructive News Comments https://www.aclweb.org/anthology/P19-1250.pdf

  282. Aiming beyond the Obvious: Identifying Non-Obvious Cases in Semantic Similarity Datasets https://www.aclweb.org/anthology/P19-1268.pdf

  283. CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech https://www.aclweb.org/anthology/P19-1271.pdf

  284. Large Dataset and Language Model Fun-Tuning for Humor Recognition https://www.aclweb.org/anthology/P19-1394.pdf

  285. Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets https://www.aclweb.org/anthology/P19-1435.pdf

  286. NNE: A Dataset for Nested Named Entity Recognition in English Newswire https://www.aclweb.org/anthology/P19-1510.pdf

  287. Towards Empathetic Open-domain Conversation Models: a New Benchmark and Dataset https://www.aclweb.org/anthology/P19-1534.pdf

  288. Active Reading Comprehension: A dataset for learning the Question-Answer Relationship strategy https://www.aclweb.org/anthology/P19-2014.pdf

  289. L-HSAB: A Levantine Twitter Dataset for Hate Speech and Abusive Language https://www.aclweb.org/anthology/W19-3512.pdf

  290. A Dataset for Noun Compositionality Detection for a Slavic Language https://www.aclweb.org/anthology/W19-3708.pdf

  291. A Dataset for Semantic Role Labelling of Hindi-English Code-Mixed Tweets https://www.aclweb.org/anthology/W19-4020.pdf

  292. A Repository of Conversational Datasets https://www.aclweb.org/anthology/W19-4101.pdf

  293. Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning https://www.aclweb.org/anthology/P18-1238.pdf

  294. Learning Translations via Images with a Massively Multilingual Image Dataset https://www.aclweb.org/anthology/P18-1239.pdf

  295. SNAG: Spoken Narratives and Gaze Dataset https://www.aclweb.org/anthology/P18-2022.pdf

  296. Automatic Article Commenting: the Task and Dataset https://www.aclweb.org/anthology/P18-2025.pdf

  297. Predicting accuracy on large datasets from smaller pilot data https://www.aclweb.org/anthology/P18-2072.pdf

  298. MeSH-based dataset for measuring the relevance of text retrieval https://www.aclweb.org/anthology/W18-2320.pdf

  299. Systematic Error Analysis of the Stanford Question Answering Dataset https://www.aclweb.org/anthology/W18-2602.pdf

  300. Japanese Sentence Compression with a Large Training Dataset https://www.aclweb.org/anthology/P17-2044.pdf

  301. STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset https://www.aclweb.org/anthology/P17-2066.pdf

  302. “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection https://www.aclweb.org/anthology/P17-2067.pdf

  303. The LAMBADA dataset: Word prediction requiring a broad discourse context∗ https://www.aclweb.org/anthology/P16-1144.pdf

  304. IBC-C: A Dataset for Armed Conflict Event Analysis https://www.aclweb.org/anthology/P16-2061.pdf

  305. Controlled and Balanced Dataset for Japanese Lexical Simplification https://www.aclweb.org/anthology/P16-3001.pdf

  306. A Dataset for Joint Noun–Noun Compound Bracketing and Interpretation https://www.aclweb.org/anthology/P16-3011.pdf

  307. RiSAWOZ: A Large-Scale Multi-Domain Wizard-of-Oz Dataset with Rich Semantic Annotations for Task-Oriented Dialogue Modeling https://www.aclweb.org/anthology/2020.emnlp-main.67.pdf

  308. IIRC: A Dataset of Incomplete Information Reading Comprehension Questions https://www.aclweb.org/anthology/2020.emnlp-main.86.pdf

  309. TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions https://www.aclweb.org/anthology/2020.emnlp-main.88.pdf

  310. ToTTo: A Controlled Table-To-Text Generation Dataset https://www.aclweb.org/anthology/2020.emnlp-main.89.pdf

  311. Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics https://www.aclweb.org/anthology/2020.emnlp-main.106.pdf

  312. MAVEN: A Massive General Domain Event Detection Dataset https://www.aclweb.org/anthology/2020.emnlp-main.129.pdf

  313. CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French https://www.aclweb.org/anthology/2020.emnlp-main.141.pdf

  314. CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models https://www.aclweb.org/anthology/2020.emnlp-main.154.pdf

  315. A Method for Building a Commonsense Inference Dataset based on Basic Events https://www.aclweb.org/anthology/2020.emnlp-main.192.pdf

  316. Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets https://www.aclweb.org/anthology/2020.emnlp-main.199.pdf

  317. Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation https://www.aclweb.org/anthology/2020.emnlp-main.207.pdf

  318. TED-CDB: A Large-Scale Chinese Discourse Relation Dataset on TED Talks https://www.aclweb.org/anthology/2020.emnlp-main.223.pdf

  319. A Visually-grounded First-person Dialogue Dataset with Verbal and Non-verbal Responses https://www.aclweb.org/anthology/2020.emnlp-main.267.pdf

  320. X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset https://www.aclweb.org/anthology/2020.emnlp-main.321.pdf

  321. CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval https://www.aclweb.org/anthology/2020.emnlp-main.340.pdf

  322. Introducing a New Dataset for Event Detection in Cybersecurity Texts https://www.aclweb.org/anthology/2020.emnlp-main.433.pdf

  323. XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation https://www.aclweb.org/anthology/2020.emnlp-main.484.pdf

  324. Interpretable Multi-dataset Evaluation for Named Entity Recognition https://www.aclweb.org/anthology/2020.emnlp-main.489.pdf

  325. A Dataset for Tracking Entities in Open Domain Procedural Text https://www.aclweb.org/anthology/2020.emnlp-main.520.pdf

  326. STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story Generation https://www.aclweb.org/anthology/2020.emnlp-main.525.pdf

  327. MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics https://www.aclweb.org/anthology/2020.emnlp-main.528.pdf

  328. DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset https://www.aclweb.org/anthology/2020.emnlp-main.562.pdf

  329. Textual Data Augmentation for Efficient Active Learning on Tiny Datasets https://www.aclweb.org/anthology/2020.emnlp-main.600.pdf

  330. PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge https://www.aclweb.org/anthology/2020.emnlp-main.611.pdf

  331. Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles https://www.aclweb.org/anthology/2020.emnlp-main.648.pdf

  332. Intrinsic Evaluation of Summarization Datasets https://www.aclweb.org/anthology/2020.emnlp-main.649.pdf

  333. doc2dial: A Goal-Oriented Document-Grounded Dialogue Dataset https://www.aclweb.org/anthology/2020.emnlp-main.652.pdf

  334. Information Seeking in the Spirit of Learning: A Dataset for Conversational Curiosity https://www.aclweb.org/anthology/2020.emnlp-main.655.pdf

  335. CANCEREMO : A Dataset for Fine-Grained Emotion Detection https://www.aclweb.org/anthology/2020.emnlp-main.715.pdf

  336. Zero-Shot Stance Detection: A Dataset and Model using Generalized Topic Representations https://www.aclweb.org/anthology/2020.emnlp-main.717.pdf

  337. MedDialog: Large-scale Medical Dialogue Datasets https://www.aclweb.org/anthology/2020.emnlp-main.743.pdf

  338. Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics https://www.aclweb.org/anthology/2020.emnlp-main.746.pdf

  339. NeuralQA: A Usable Library for Question Answering (Contextual Query Expansion + BERT) on Large Datasets https://www.aclweb.org/anthology/2020.emnlp-demos.3.pdf

  340. In Data We Trust: A Critical Analysis of Hate Speech Detection Datasets https://www.aclweb.org/anthology/2020.alw-1.18.pdf

  341. On Cross-Dataset Generalization in Automatic Detection of Online Abuse https://www.aclweb.org/anthology/2020.alw-1.20.pdf

  342. MeDAL: Medical Abbreviation Disambiguation Dataset for Natural Language Understanding Pretraining https://www.aclweb.org/anthology/2020.clinicalnlp-1.15.pdf

  343. Multilingual Argument Mining: Datasets and Analysis https://www.aclweb.org/anthology/2020.findings-emnlp.29.pdf

  344. The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification https://www.aclweb.org/anthology/2020.findings-emnlp.32.pdf

  345. KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding https://www.aclweb.org/anthology/2020.findings-emnlp.39.pdf

  346. LiMiT: The Literal Motion in Text Dataset https://www.aclweb.org/anthology/2020.findings-emnlp.88.pdf

  347. MedICaT: A Dataset of Medical Images, Captions, and Textual References https://www.aclweb.org/anthology/2020.findings-emnlp.191.pdf

  348. Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report Generation https://www.aclweb.org/anthology/2020.findings-emnlp.202.pdf

  349. Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles https://www.aclweb.org/anthology/2020.findings-emnlp.272.pdf

  350. CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems https://www.aclweb.org/anthology/2020.findings-emnlp.329.pdf

  351. Modeling Preconditions in Text with a Crowd-sourced Dataset https://www.aclweb.org/anthology/2020.findings-emnlp.340.pdf

  352. WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization https://www.aclweb.org/anthology/2020.findings-emnlp.360.pdf

  353. STANDER: An Expert-Annotated Dataset for News Stance Detection and Evidence Retrieval https://www.aclweb.org/anthology/2020.findings-emnlp.365.pdf

  354. Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder https://www.aclweb.org/anthology/2020.findings-emnlp.373.pdf

  355. #Turki$hTweets: A Benchmark Dataset for Turkish Text Correction https://www.aclweb.org/anthology/2020.findings-emnlp.374.pdf

  356. Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines https://www.aclweb.org/anthology/2020.findings-emnlp.380.pdf

  357. Automatic Term Name Generation for Gene Ontology: Task and Dataset https://www.aclweb.org/anthology/2020.findings-emnlp.422.pdf

  358. Finding Friends and Flipping Frenemies: Automatic Paraphrase Dataset Augmentation Using Graph Theory https://www.aclweb.org/anthology/2020.findings-emnlp.426.pdf

  359. Weibo-COV: A Large-Scale COVID-19 Social Media Dataset from Weibo https://www.aclweb.org/anthology/2020.nlpcovid19-2.34.pdf

  360. Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset https://www.aclweb.org/anthology/2020.sdp-1.5.pdf

  361. Do We Need to Create Big Datasets to Learn a Task? https://www.aclweb.org/anthology/2020.sustainlp-1.23.pdf

  362. Determining Question-Answer Plausibility in Crowdsourced Datasets Using Multi-Task Learning https://www.aclweb.org/anthology/2020.wnut-1.4.pdf

  363. Civil Unrest on Twitter (CUT): A Dataset of Tweets to Support Research on Civil Unrest https://www.aclweb.org/anthology/2020.wnut-1.28.pdf

  364. Towards Medical Machine Reading Comprehension with Structural Knowledge and Plain Text https://www.aclweb.org/anthology/2020.emnlp-main.111.pdf

  365. Querying Across Genres for Medical Claims in News https://www.aclweb.org/anthology/2020.emnlp-main.139.pdf

  366. Biomedical Event Extraction with Hierarchical Knowledge Graphs https://www.aclweb.org/anthology/2020.findings-emnlp.114.pdf

  367. Characterizing the Value of Information in Medical Notes https://www.aclweb.org/anthology/2020.findings-emnlp.187.pdf

  368. Generating Accurate Electronic Health Assessment from Medical Graph https://www.aclweb.org/anthology/2020.findings-emnlp.336.pdf

  369. KnowledgeNet: A Benchmark Dataset for Knowledge Base Population https://www.aclweb.org/anthology/D19-1069.pdf

  370. GGPONC: A Corpus of German Medical Text with Rich Metadata Based on Clinical Practice Guidelines https://www.aclweb.org/anthology/2020.louhi-1.5.pdf

  371. (Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas https://www.aclweb.org/anthology/D19-1179.pdf

  372. CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases https://www.aclweb.org/anthology/D19-1204.pdf

  373. UR-FUNNY: A Multimodal Language Dataset for Understanding Humor https://www.aclweb.org/anthology/D19-1211.pdf

  374. BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on Novels https://www.aclweb.org/anthology/D19-1249.pdf

  375. PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification https://www.aclweb.org/anthology/D19-1382.pdf

  376. Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach https://www.aclweb.org/anthology/D19-1404.pdf

  377. MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims https://www.aclweb.org/anthology/D19-1475.pdf

  378. A Benchmark Dataset for Learning to Intervene in Online Hate Speech https://www.aclweb.org/anthology/D19-1482.pdf

  379. YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension https://www.aclweb.org/anthology/D19-1517.pdf

  380. JuICe: A Large Scale Distantly Supervised Dataset for Open Domain Context-based Code Generation https://www.aclweb.org/anthology/D19-1546.pdf

  381. A Dataset of General-Purpose Rebuttal https://www.aclweb.org/anthology/D19-1561.pdf

  382. Automatic Argument Quality Assessment - New Datasets and Methods https://www.aclweb.org/anthology/D19-1564.pdf

  383. Do Nuclear Submarines Have Nuclear Captains? A Challenge Dataset for Commonsense Reasoning over Adjectives and Objects https://www.aclweb.org/anthology/D19-1625.pdf

  384. WIQA: A dataset for "What if..." reasoning over procedural text https://www.aclweb.org/anthology/D19-1629.pdf

  385. The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali–English and Sinhala–English https://www.aclweb.org/anthology/D19-1632.pdf

  386. A Challenge Dataset and Effective Models for Aspect-Based Sentiment Analysis https://www.aclweb.org/anthology/D19-1654.pdf

  387. DENS: A Dataset for Multi-class Emotion Analysis https://www.aclweb.org/anthology/D19-1656.pdf

  388. RUN through the Streets: A New Dataset and Baseline Models for Realistic Urban Navigation https://www.aclweb.org/anthology/D19-1681.pdf

  389. Rumor Detection on Social Media: Datasets, Methods and Opportunities https://www.aclweb.org/anthology/D19-5008.pdf

  390. WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset https://www.aclweb.org/anthology/D19-5224.pdf

  391. SAMSum Corpus: A Human-annotated Dialogue Dataset for Abstractive Summarization https://www.aclweb.org/anthology/D19-5409.pdf

  392. ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine Reading Comprehension https://www.aclweb.org/anthology/D19-5820.pdf

  393. A Dataset of Crowdsourced Word Sequences: Collections and Answer Aggregation for Ground Truth Creation https://www.aclweb.org/anthology/D19-5904.pdf

  394. Unlearn Dataset Bias in Natural Language Inference by Fitting the Residual https://www.aclweb.org/anthology/D19-6115.pdf

  395. Dreaddit: A Reddit Dataset for Stress Analysis in Social Media https://www.aclweb.org/anthology/D19-6213.pdf

  396. Incorporating Domain Knowledge into Medical NLI using Knowledge Graphs https://www.aclweb.org/anthology/D19-1631.pdf

  397. MedCATTrainer: A Biomedical Free Text Annotation Interface with Active Learning and Research Use Case Specific Customisation https://www.aclweb.org/anthology/D19-3024.pdf

  398. Developing a Curated Topic Model for COVID-19 Medical Research Literature https://www.aclweb.org/anthology/2020.nlpcovid19-2.30.pdf

  399. Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets https://www.aclweb.org/anthology/D19-1107.pdf

  400. Towards Extracting Medical Family History from Natural Language Interactions: A New Dataset and Baselines https://www.aclweb.org/anthology/D19-1122.pdf

  401. Biomedical Event Extraction as Multi-turn Question Answering https://www.aclweb.org/anthology/2020.louhi-1.10.pdf

  402. PreCo: A Large-scale Dataset in Preschool Vocabulary for Coreference Resolution https://www.aclweb.org/anthology/D18-1016.pdf

  403. A Dataset for Document Grounded Conversations https://www.aclweb.org/anthology/D18-1076.pdf

  404. A Dataset for Telling the Stories of Social Media Videos https://www.aclweb.org/anthology/D18-1117.pdf

  405. A dataset and baselines for sequential open-domain question answering https://www.aclweb.org/anthology/D18-1134.pdf

  406. CARD-660: Cambridge Rare Word Dataset – a Reliable Benchmark for Infrequent Word Representation Models https://www.aclweb.org/anthology/D18-1169.pdf

  407. Large-scale Cloze Test Dataset Created by Teachers https://www.aclweb.org/anthology/D18-1257.pdf

  408. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task https://www.aclweb.org/anthology/D18-1425.pdf

  409. FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation https://www.aclweb.org/anthology/D18-1514.pdf

  410. MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling https://www.aclweb.org/anthology/D18-1547.pdf

  411. KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos https://www.aclweb.org/anthology/D18-2016.pdf

  412. APLenty: annotation tool for creating high-quality datasets using active and proactive learning https://www.aclweb.org/anthology/D18-2019.pdf

  413. Hate Speech Dataset from a White Supremacy Forum https://www.aclweb.org/anthology/W18-5102.pdf

  414. Creating a WhatsApp Dataset to Study Pre-teen Cyberbullying https://www.aclweb.org/anthology/W18-5107.pdf

  415. Datasets of Slovene and Croatian Moderated News Comments https://www.aclweb.org/anthology/W18-5116.pdf

  416. Semantic role labeling tools for biomedical question answering: a study of selected tools on the BioASQ datasets https://www.aclweb.org/anthology/W18-5302.pdf

  417. An Adaption of BIOASQ Question Answering dataset for Machine Reading systems by Manual Annotations of Answer Spans https://www.aclweb.org/anthology/W18-5309.pdf

  418. UNCC QA: A Biomedical Question Answering System https://www.aclweb.org/anthology/W18-5308.pdf