-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Pull requests: huggingface/datasets
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add test for
as_iterable_dataset()
method in DatasetBuilder
#7629
opened Jun 19, 2025 by
ArjunJagdale
Loading…
Add
as_iterable_dataset()
method to DatasetBuilder for streaming from cached Arrow files
#7628
opened Jun 19, 2025 by
ArjunJagdale
Loading…
feat(map): reuse unchanged columns when input_columns specified to reduce disk usage (#6013)
#7626
opened Jun 19, 2025 by
ArjunJagdale
Loading…
feat: Add h5folder dataset loader for HDF5 support
#7625
opened Jun 19, 2025 by
ArjunJagdale
Loading…
Guard against duplicate builder_kwargs/config_kwargs in load_dataset_…
#7622
opened Jun 17, 2025 by
Shohail-Ismail
Loading…
fix: raise error when folder-based datasets are loaded without data_dir or data_files
#7618
opened Jun 16, 2025 by
ArjunJagdale
Loading…
7 of 9 tasks
Enhance error handling and input validation across multiple modules
#7602
opened Jun 8, 2025 by
mohiuddin-khan-shiam
Loading…
(refactor) remove redundant logic in _check_valid_index_key
#7490
opened Mar 30, 2025 by
suzyahyah
Loading…
Allow dataset row indexing with np.int types (#7423)
#7438
opened Mar 6, 2025 by
DavidRConnell
Loading…
[Audio Features - DO NOT MERGE] PoC for adding an offset+sliced reading to audio file.
#7312
opened Dec 8, 2024 by
TParcollet
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.