Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Two new options: CutSet.from_shar(seed="trng") and DynamicCutSampler(quadratic_duration=...) #1199

Merged
merged 5 commits into from
Nov 6, 2023

Conversation

pzelasko
Copy link
Collaborator

@pzelasko pzelasko commented Nov 1, 2023

  • TRNG is useful when dataloading from shar but for any reason the training dataloader has to be recreated between every validation (shar typically depends keeping track of epochs internally to reshuffle the shards, with TRNG it doesn't have to at the cost of lack of exact reproducibility).

  • exposed quadratic duration in dynamic cut sampler (useful when cutset is sorted by duration, then dynamic cut sampler can act as optimal pseudo-bucketing, and using quadratic_duration can help tune max_duration to an optimal value)

@pzelasko pzelasko added this to the v1.18 milestone Nov 1, 2023
@pzelasko pzelasko merged commit 8a4b3b5 into master Nov 6, 2023
8 of 10 checks passed
@pzelasko pzelasko deleted the feature/shar-trng branch November 6, 2023 01:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant