diff --git a/docs/hub/_redirects.yml b/docs/hub/_redirects.yml index b0225497a..583c0d6c9 100644 --- a/docs/hub/_redirects.yml +++ b/docs/hub/_redirects.yml @@ -20,3 +20,4 @@ security-two-fa: security-2fa repositories-recommendations: storage-limits xet: xet/index storage-backends: xet/index +datasets-viewer: data-studio diff --git a/docs/hub/_toctree.yml b/docs/hub/_toctree.yml index bc90b7bc8..a8f89d34c 100644 --- a/docs/hub/_toctree.yml +++ b/docs/hub/_toctree.yml @@ -269,7 +269,7 @@ title: Spark - local: datasets-webdataset title: WebDataset - - local: datasets-viewer + - local: data-studio title: Data Studio sections: - local: datasets-viewer-configure diff --git a/docs/hub/billing.md b/docs/hub/billing.md index 3c85c0cc9..8523ac2ba 100644 --- a/docs/hub/billing.md +++ b/docs/hub/billing.md @@ -31,7 +31,7 @@ The PRO subscription unlocks essential features for serious users, including: - Higher tier for ZeroGPU Spaces usage - Ability to create ZeroGPU Spaces and use Dev Mode - Ability to publish Social Posts and Community Blogs -- Leverage the [Data Studio](./datasets-viewer) on private datasets +- Leverage the [Data Studio](./data-studio) on private datasets - Run and schedule serverless [CPU/ GPU Jobs](https://huggingface.co/docs/huggingface_hub/en/guides/jobs) View the full list of benefits at https://huggingface.co/pro then subscribe over at https://huggingface.co/subscribe/pro diff --git a/docs/hub/datasets-adding.md b/docs/hub/datasets-adding.md index 7e345a688..a1b3b68e0 100644 --- a/docs/hub/datasets-adding.md +++ b/docs/hub/datasets-adding.md @@ -101,7 +101,7 @@ For most types of datasets, **Parquet** is the recommended format due to its eff ### Data Studio -The [Data Studio](./datasets-viewer) is useful to know how the data actually looks like before you download it. +The [Data Studio](./data-studio) is useful to know how the data actually looks like before you download it. It is enabled by default for all public datasets. It is also available for private datasets owned by a [PRO user](https://huggingface.co/pricing) or a [Team or Enterprise organization](https://huggingface.co/enterprise). After uploading your dataset, make sure the Dataset Viewer correctly shows your data, or [Configure the Dataset Viewer](./datasets-viewer-configure). diff --git a/docs/hub/datasets-audio.md b/docs/hub/datasets-audio.md index af96a6edb..29a67e9f0 100644 --- a/docs/hub/datasets-audio.md +++ b/docs/hub/datasets-audio.md @@ -193,8 +193,8 @@ train-0000/ └── 999.json ``` -Note that for user convenience and to enable the [Dataset Viewer](./datasets-viewer), every dataset hosted in the Hub is automatically converted to Parquet format up to 5GB. -Read more about it in the [Parquet format](./datasets-viewer#access-the-parquet-files) documentation. +Note that for user convenience and to enable the [Dataset Viewer](./data-studio), every dataset hosted in the Hub is automatically converted to Parquet format up to 5GB. +Read more about it in the [Parquet format](./data-studio#access-the-parquet-files) documentation. ### Parquet format diff --git a/docs/hub/datasets-image.md b/docs/hub/datasets-image.md index 6b7456072..3784caaa2 100644 --- a/docs/hub/datasets-image.md +++ b/docs/hub/datasets-image.md @@ -191,8 +191,8 @@ train-0000/ └── 999.json ``` -Note that for user convenience and to enable the [Dataset Viewer](./datasets-viewer), every dataset hosted in the Hub is automatically converted to Parquet format up to 5GB. -Read more about it in the [Parquet format](./datasets-viewer#access-the-parquet-files) documentation. +Note that for user convenience and to enable the [Dataset Viewer](./data-studio), every dataset hosted in the Hub is automatically converted to Parquet format up to 5GB. +Read more about it in the [Parquet format](./data-studio#access-the-parquet-files) documentation. ### Parquet format diff --git a/docs/hub/datasets-overview.md b/docs/hub/datasets-overview.md index 7d2f8a800..2ad8c1448 100644 --- a/docs/hub/datasets-overview.md +++ b/docs/hub/datasets-overview.md @@ -2,7 +2,7 @@ ## Datasets on the Hub -The Hugging Face Hub hosts a [large number of community-curated datasets](https://huggingface.co/datasets) for a diverse range of tasks such as translation, automatic speech recognition, and image classification. Alongside the information contained in the [dataset card](./datasets-cards), many datasets, such as [GLUE](https://huggingface.co/datasets/nyu-mll/glue), include a [Dataset Viewer](./datasets-viewer) to showcase the data. +The Hugging Face Hub hosts a [large number of community-curated datasets](https://huggingface.co/datasets) for a diverse range of tasks such as translation, automatic speech recognition, and image classification. Alongside the information contained in the [dataset card](./datasets-cards), many datasets, such as [GLUE](https://huggingface.co/datasets/nyu-mll/glue), include a [Dataset Viewer](./data-studio) to showcase the data. Each dataset is a [Git repository](./repositories) that contains the data required to generate splits for training, evaluation, and testing. For information on how a dataset repository is structured, refer to the [Data files Configuration page](./datasets-data-files-configuration). Following the supported repo structure will ensure that the dataset page on the Hub will have a Viewer. diff --git a/docs/hub/datasets-video.md b/docs/hub/datasets-video.md index 2914f7a2a..bd54f2133 100644 --- a/docs/hub/datasets-video.md +++ b/docs/hub/datasets-video.md @@ -190,4 +190,4 @@ train-0000/ └── 999.json ``` -Note that for user convenience and to enable the [Dataset Viewer](./datasets-viewer), every dataset hosted in the Hub is automatically converted to Parquet format up to 5GB. Since videos can be quite large, the URLs to the videos are stored in the converted Parquet data without the video bytes themselves. Read more about it in the [Parquet format](./datasets-viewer#access-the-parquet-files) documentation. +Note that for user convenience and to enable the [Dataset Viewer](./data-studio), every dataset hosted in the Hub is automatically converted to Parquet format up to 5GB. Since videos can be quite large, the URLs to the videos are stored in the converted Parquet data without the video bytes themselves. Read more about it in the [Parquet format](./data-studio#access-the-parquet-files) documentation. diff --git a/docs/hub/datasets-viewer.md b/docs/hub/datasets-viewer.md index 415ff5501..49a493cdb 100644 --- a/docs/hub/datasets-viewer.md +++ b/docs/hub/datasets-viewer.md @@ -27,7 +27,7 @@ You can search for a word in the dataset by typing it in the search bar at the t ## Run SQL queries on the dataset -You can run SQL queries on the dataset in the browser using the SQL Console. This feature also leverages our [auto-conversion to Parquet](datasets-viewer#access-the-parquet-files). +You can run SQL queries on the dataset in the browser using the SQL Console. This feature also leverages our [auto-conversion to Parquet](data-studio#access-the-parquet-files).
diff --git a/docs/hub/model-cards.md b/docs/hub/model-cards.md index d8c737229..7f8ea166d 100644 --- a/docs/hub/model-cards.md +++ b/docs/hub/model-cards.md @@ -277,7 +277,7 @@ Yes, you can add custom tags to your model by adding them to the `tags` field in ### How can I indicate that my model is not suitable for all audiences -You can add a `not-for-all-audience` tag to your model card metadata. When this tag is present, a message will be displayed on the model page indicating that the model is not for all audiences. Users can click through this message to view the model card. +You can add a `not-for-all-audiences` tag to your model card metadata. When this tag is present, a message will be displayed on the model page indicating that the model is not for all audiences. Users can click through this message to view the model card. ### Can I write LaTeX in my model card?