diff --git a/datasets/circa/README.md b/datasets/circa/README.md index 436847d57bd..24db742b58a 100644 --- a/datasets/circa/README.md +++ b/datasets/circa/README.md @@ -20,7 +20,7 @@ task_ids: - text-classification-other-question-answer-pair-classification --- -# Dataset Card Creation Guide +# Dataset Card for CIRCA ## Table of Contents - [Dataset Description](#dataset-description) diff --git a/datasets/multi_nli/README.md b/datasets/multi_nli/README.md index c5ff8b42715..21ccf4cb71f 100644 --- a/datasets/multi_nli/README.md +++ b/datasets/multi_nli/README.md @@ -23,7 +23,7 @@ task_ids: - semantic-similarity-scoring --- -# Dataset Card for "multi_nli" +# Dataset Card for Multi-Genre Natural Language Inference (MultiNLI) ## Table of Contents - [Dataset Description](#dataset-description) @@ -127,17 +127,23 @@ They constructed MultiNLI so as to make it possible to explicitly evaluate model ### Source Data +#### Initial Data Collection and Normalization + They created each sentence pair by selecting a premise sentence from a preexisting text source and asked a human annotator to compose a novel sentence to pair with it as a hypothesis. +#### Who are the source language producers? + +[More Information Needed] + ### Annotations #### Annotation process -[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) +[More Information Needed] #### Who are the annotators? -[More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) +[More Information Needed] ### Personal and Sensitive Information diff --git a/datasets/multi_nli_mismatch/README.md b/datasets/multi_nli_mismatch/README.md index 6485b8275fb..b0b74462164 100644 --- a/datasets/multi_nli_mismatch/README.md +++ b/datasets/multi_nli_mismatch/README.md @@ -1,7 +1,29 @@ --- +annotations_creators: +- crowdsourced +language_creators: +- crowdsourced +- found +languages: +- en +licenses: +- cc-by-3.0 +- cc-by-sa-3.0-at +- mit +- other-Open Portion of the American National Corpus +multilinguality: +- monolingual +size_categories: +- 100K