Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Offline dataset viewer #6139

Closed
yuvalkirstain opened this issue Aug 10, 2023 · 7 comments
Closed

Offline dataset viewer #6139

yuvalkirstain opened this issue Aug 10, 2023 · 7 comments
Labels
dataset-viewer Related to the dataset viewer on huggingface.co enhancement New feature or request

Comments

@yuvalkirstain
Copy link

Feature request

The dataset viewer feature is very nice. It enables to the user to easily view the dataset. However, when working for private companies we cannot always upload the dataset to the hub. Is there a way to create dataset viewer offline? I.e. to run a code that will open some kind of html or something that makes it easy to view the dataset.

Motivation

I want to easily view my dataset even when it is hosted locally.

Your contribution

N.A.

@yuvalkirstain yuvalkirstain added the enhancement New feature or request label Aug 10, 2023
@severo
Copy link
Collaborator

severo commented Aug 10, 2023

Hi, thanks for the suggestion. It's not possible at the moment. The viewer is part of the Hub codebase and only works on public datasets. Also, it relies on Datasets Server, which prepares the data and provides an API to access the rows, size, etc.

If you're interested in hosting your data as a private dataset on the Hub, you might want to look at huggingface/dataset-viewer#39.

@ssuwelack
Copy link

Hi, we are building an offline dataset viewer: https://github.com/Renumics/spotlight
It supports many HF datasets, but currently you have to use it via Pandas:
df=ds.to_pandas()
spotlight.show(df)

Would love to hear from you if that works for your use case. If not, feel free to open an issue on the repo: https://github.com/Renumics/spotlight/issues

@yuvalkirstain
Copy link
Author

@ssuwelack thank you! I will definitely try it out.

@severo severo added the dataset-viewer Related to the dataset viewer on huggingface.co label Sep 25, 2023
@severo
Copy link
Collaborator

severo commented Sep 29, 2023

Closing for now, as developing and maintaining an offline viewer is not planned.

@severo severo closed this as completed Sep 29, 2023
@severo
Copy link
Collaborator

severo commented Feb 13, 2024

@yuvalkirstain the dataset viewer is now available on private datasets for PRO users and Enterprise Hub orgs. Would it fit your needs?

@laubonghaudoi
Copy link

Hi @ssuwelack I tried loading a HF dataset with your viewer but got this error Renumics/spotlight#461 hope the team can help me on this. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataset-viewer Related to the dataset viewer on huggingface.co enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants