From 652ee5ef7315ddd3c4a2c89b6d35de636a19f116 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Quentin=20Gallou=C3=A9dec?= <45557362+qgallouedec@users.noreply.github.com> Date: Thu, 6 Nov 2025 15:16:57 -0700 Subject: [PATCH] Change arxiv to hg papers --- CONTRIBUTING.md | 2 +- README.md | 2 +- docs/source/dataset_card.mdx | 2 +- templates/README_guide.md | 2 +- 4 files changed, 4 insertions(+), 4 deletions(-) diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md index f1f022b6fd7..3ae44bd4efc 100644 --- a/CONTRIBUTING.md +++ b/CONTRIBUTING.md @@ -120,7 +120,7 @@ If you are a **dataset author**... you know what to do, it is your dataset after If you are a **user of a dataset**, the main source of information should be the dataset paper if it is available: we recommend pulling information from there into the relevant paragraphs of the template. We also eagerly welcome discussions on the [Considerations for Using the Data](https://github.com/huggingface/datasets/blob/main/templates/README_guide.md#considerations-for-using-the-data) based on existing scholarship or personal experience that would benefit the whole community. -Finally, if you want more information on the how and why of dataset cards, we strongly recommend reading the foundational works [Datasheets for Datasets](https://arxiv.org/abs/1803.09010) and [Data Statements for NLP](https://www.aclweb.org/anthology/Q18-1041/). +Finally, if you want more information on the how and why of dataset cards, we strongly recommend reading the foundational works [Datasheets for Datasets](https://huggingface.co/papers/1803.09010) and [Data Statements for NLP](https://www.aclweb.org/anthology/Q18-1041/). Thank you for your contribution! diff --git a/README.md b/README.md index d4162b9e761..0b70a39d098 100644 --- a/README.md +++ b/README.md @@ -136,7 +136,7 @@ If you're a dataset owner and wish to update any part of it (description, citati ## BibTeX -If you want to cite our 🤗 Datasets library, you can use our [paper](https://arxiv.org/abs/2109.02846): +If you want to cite our 🤗 Datasets library, you can use our [paper](https://huggingface.co/papers/2109.02846): ```bibtex @inproceedings{lhoest-etal-2021-datasets, diff --git a/docs/source/dataset_card.mdx b/docs/source/dataset_card.mdx index f1067697fb2..3cd77e1fc00 100644 --- a/docs/source/dataset_card.mdx +++ b/docs/source/dataset_card.mdx @@ -1,7 +1,7 @@ # Create a dataset card Each dataset should have a dataset card to promote responsible usage and inform users of any potential biases within the dataset. -This idea was inspired by the Model Cards proposed by [Mitchell, 2018](https://arxiv.org/abs/1810.03993). +This idea was inspired by the Model Cards proposed by [Mitchell, 2018](https://huggingface.co/papers/1810.03993). Dataset cards help users understand a dataset's contents, the context for using the dataset, how it was created, and any other considerations a user should be aware of. Creating a dataset card is easy and can be done in just a few steps: diff --git a/templates/README_guide.md b/templates/README_guide.md index 8be42708543..d8e7173c84f 100644 --- a/templates/README_guide.md +++ b/templates/README_guide.md @@ -163,7 +163,7 @@ Also describe in this section if the proposed dataset contains a low-resource or Provide descriptions of specific biases that are likely to be reflected in the data, and state whether any steps were taken to reduce their impact. -For Wikipedia text, see for example [Dinan et al 2020 on biases in Wikipedia (esp. Table 1)](https://arxiv.org/abs/2005.00614), or [Blodgett et al 2020](https://www.aclweb.org/anthology/2020.acl-main.485/) for a more general discussion of the topic. +For Wikipedia text, see for example [Dinan et al 2020 on biases in Wikipedia (esp. Table 1)](https://huggingface.co/papers/2005.00614), or [Blodgett et al 2020](https://www.aclweb.org/anthology/2020.acl-main.485/) for a more general discussion of the topic. If analyses have been run quantifying these biases, please add brief summaries and links to the studies here.