Skip to content

🎉 feat(storage) add split args for init_from_disk, add train_test_split for all backends#309

Merged
bstaber merged 8 commits intomainfrom
maestro_needs
Feb 17, 2026
Merged

🎉 feat(storage) add split args for init_from_disk, add train_test_split for all backends#309
bstaber merged 8 commits intomainfrom
maestro_needs

Conversation

@casenave
Copy link
Copy Markdown
Member

@casenave casenave commented Feb 4, 2026

Checklist

  • Typing enforced
  • Documentation updated
  • Changelog updated
  • Tests and Example updates
  • Coverage should be 100%

Related issues

Closes #261

@casenave casenave requested a review from a team as a code owner February 4, 2026 10:33
@codecov
Copy link
Copy Markdown

codecov Bot commented Feb 4, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@casenave casenave marked this pull request as draft February 4, 2026 10:54
@casenave casenave marked this pull request as ready for review February 4, 2026 20:35
@casenave casenave marked this pull request as draft February 8, 2026 07:54
@casenave casenave marked this pull request as ready for review February 8, 2026 08:32
@casenave
Copy link
Copy Markdown
Member Author

@bstaber @xroynard ready to review!

)

# Create new dataset instances with filtered IDs
train_dataset = ZarrDataset.__new__(ZarrDataset)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you know why you need to call new (skipping the init) ?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It allows creating the dataset instance with the correct indices while avoiding an unnecessary call to the constructor.

@bstaber bstaber merged commit a48780b into main Feb 17, 2026
21 checks passed
@bstaber bstaber deleted the maestro_needs branch February 17, 2026 09:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

partially reconstruct sample from HF dataset

3 participants