From 4abbd58f9e6eb0d1208b94e437cc8a1759b34a83 Mon Sep 17 00:00:00 2001 From: Christopher Akiki Date: Wed, 10 Sep 2025 14:27:59 +0200 Subject: [PATCH] Update README.md Add more information about the data files --- README.md | 18 ++++++++---------- 1 file changed, 8 insertions(+), 10 deletions(-) diff --git a/README.md b/README.md index 8e0d79f..c6b386a 100644 --- a/README.md +++ b/README.md @@ -2,23 +2,21 @@ You can download the full dataset behind [paperswithcode.com](https://paperswithcode.com) here: -Download links for the data dumps are: +Download links for the last available public snapshot of the data dumps are: -- [All papers with abstracts](https://huggingface.co/datasets/pwc-archive/papers-with-abstracts) -- [Links between papers and code](https://huggingface.co/datasets/pwc-archive/links-between-paper-and-code) -- [Evaluation tables](https://huggingface.co/datasets/pwc-archive/evaluation-tables) -- [Methods](https://huggingface.co/datasets/pwc-archive/methods) -- [Datasets](https://huggingface.co/datasets/pwc-archive/datasets) +- [All papers with abstracts](https://huggingface.co/datasets/pwc-archive/papers-with-abstracts) (Retrieved July 29th, 2025) +- [Links between papers and code](https://huggingface.co/datasets/pwc-archive/links-between-paper-and-code) (Retrieved July 28th, 2025) +- [Evaluation tables](https://huggingface.co/datasets/pwc-archive/evaluation-tables) (Retrieved July 28th, 2025) +- [Methods](https://huggingface.co/datasets/pwc-archive/methods) (Retrieved July 28th, 2025) +- [Datasets](https://huggingface.co/datasets/pwc-archive/datasets) (Retrieved July 28th, 2025) The last JSON is in the [sota-extractor](https://github.com/paperswithcode/sota-extractor) format and the code from there can be used to load in the JSON into a set of Python classes. -At the moment, data is regenerated daily. +At the moment, data is no longer being regenerated daily. Part of the data is coming from the sources listed in the [sota-extractor README](https://github.com/paperswithcode/sota-extractor). ## Licence -All data is licenced under [CC-BY-SA](https://creativecommons.org/licenses/by-sa/4.0/). - - +All data is licenced under [CC-BY-SA](https://creativecommons.org/licenses/by-sa/4.0/).