Skip to content

Conversation

@michel-aractingi
Copy link
Collaborator

@michel-aractingi michel-aractingi commented Sep 20, 2025

What this does

Bump hf datasets version to 4.0.0 at minimum.

HF Datasets v4 introduces some nice features for streaming and some breaking changes with respect to previous changes.

LeRobotDatasets v2.1 was not made compatible with the with the v4 release and we chose to cap the dependency. However, the recent merge of #1412, LeRobotDataset v3 is compatible with the recent versions of hf datasets thus allowing us to bump the version. Now, we can be more up to date with the latest hf datasets features without having to modify any logic in our code.

Tests

Checked that the tests for datasets pass for all v4+ releases (4.0.0, 4.1.0, 4.1.1)

Copilot AI review requested due to automatic review settings September 20, 2025 20:51
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Upgrade the Hugging Face datasets dependency to a new major version.

  • Update dependency constraint for datasets to 4.0.0
  • Remove the old TODO comment

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

@michel-aractingi michel-aractingi self-assigned this Sep 20, 2025
@michel-aractingi michel-aractingi added dataset Issues regarding data inputs, processing, or datasets dependencies Concerns about external packages, libraries, or versioning labels Sep 20, 2025
@CarolinePascal CarolinePascal self-requested a review September 21, 2025 09:51
Copy link
Collaborator

@CarolinePascal CarolinePascal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@michel-aractingi michel-aractingi merged commit ce3670a into main Sep 22, 2025
18 checks passed
@michel-aractingi michel-aractingi deleted the chore/bump-datasets-v4 branch September 22, 2025 08:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dataset Issues regarding data inputs, processing, or datasets dependencies Concerns about external packages, libraries, or versioning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants