From a52788af4d5f5ff624f831b305d1a20fc1cb54a2 Mon Sep 17 00:00:00 2001 From: Simon Willison Date: Thu, 1 Feb 2024 11:50:02 -0800 Subject: [PATCH] Fix broken data sheet link in README Closes #106 --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 7230fc7e..b17ae80e 100644 --- a/README.md +++ b/README.md @@ -12,7 +12,7 @@ It was created as a training corpus for [OLMo](https://allenai.org/olmo), a lang Dolma is available for download on the HuggingFace 🤗 Hub: [`huggingface.co/datasets/allenai/dolma`](https://huggingface.co/datasets/allenai/dolma). To access Dolma, users must agree to the terms of [AI2 ImpACT License for Medium Risk Artifacts](https://allenai.org/licenses/impact-mr) on the [HuggingFace 🤗 Hub](https://huggingface.co/datasets/allenai/dolma). Once agreed you can follow the instructions [here](https://huggingface.co/datasets/allenai/dolma#download) to download the dataset. -You can also read more about Dolma in [our announcement](https://blog.allenai.org/dolma-3-trillion-tokens-open-llm-corpus-9a0ff4b8da64), as well as by consulting its [data sheet](docs/assets/dolma-datasheet-v0.1.pdf). +You can also read more about Dolma in [our announcement](https://blog.allenai.org/dolma-3-trillion-tokens-open-llm-corpus-9a0ff4b8da64), as well as by consulting its [data sheet](docs/assets/dolma-v0_1-20230819.pdf). ## Dolma Toolkit