Fix/max_samples_per_ts #2987

brunnedu · 2026-01-10T10:26:45Z

Checklist before merging this PR:

Mentioned all issues that this PR fixes or addresses.
Summarized the updates of this PR under Summary.
Added an entry under Unreleased in the Changelog.

Summary

This PR fixes a bug in ShiftedTorchTrainingDataset (and its subclasses SequentialTorchTrainingDataset and HorizonBasedTorchTrainingDataset) where the max_samples_per_ts parameter was not properly acting as an upper bound on the number of samples extracted per time series.

Example from issue:

series = linear_timeseries(length=1000)
dataset = ShiftedTorchTrainingDataset(
    series,
    input_chunk_length=11,
    output_chunk_length=13,
    max_samples_per_ts=5000,
)
# Before: len(dataset) == 5000 (incorrect)
# After: len(dataset) == 987 (correct: 1000 - (13+1) + 1)

Changes made:

Fixed calculation logic in ShiftedTorchTrainingDataset.__init__() to cap max_samples_per_ts at the maximum extractable samples over all series.
Added unit test (test_max_samples_per_ts_upper_bound) that verifies:
- Behavior with max_samples_per_ts=None
- Behavior when max_samples_per_ts > actual_max (the bug case)
- Behavior when max_samples_per_ts < actual_max
- Behavior with stride > 1
- Behavior with multiple series of different lengths

Thanks to @daidahao for identifying and reporting this issue!

…angelog

codecov · 2026-01-10T10:41:51Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 95.56%. Comparing base (72edd10) to head (4306d66).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2987      +/-   ##
==========================================
- Coverage   95.63%   95.56%   -0.07%     
==========================================
  Files         153      153              
  Lines       16433    16435       +2     
==========================================
- Hits        15715    15706       -9     
- Misses        718      729      +11

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

daidahao · 2026-01-10T18:22:28Z

@brunnedu
Thank you for the quick fix!

daidahao · 2026-01-14T16:42:24Z

@brunnedu Hi, do you know when could this be merged? I also have a TimesFM PR that would need a review.

brunnedu · 2026-01-15T08:31:06Z

Hi @daidahao, as @dennisbader, our codeowner, is currently away on leave, it will likely be another 2 weeks or so before we can merge these PRs. Regarding the TimesFM PR, I’d like @dennisbader to give it a final look once he’s back, as his familiarity with foundation models will be really valuable there. Thanks for your contributions!

CHANGELOG.md

jakubchlapek

small comment on the changelog, but LGTM :) thanks

Co-authored-by: Jakub Chłapek <147340544+jakubchlapek@users.noreply.github.com>

dennisbader

Beautiful PR, thanks a lot @brunnedu 🚀

Just as a note to @daidahao: for multi-series of different lengths this will still upsample shorter series to have the same number of samples for each series (according to max samples of the longest series).

We can complete this PR once #2995 has been merged :)

fix max_samples_per_ts not acting as upper bound; add test; update ch…

f09033e

…angelog

brunnedu requested a review from dennisbader as a code owner January 10, 2026 10:26

brunnedu changed the title ~~fix max_samples_per_ts not acting as upper bound; add test; update ch…~~ Fix/max_samples_per_ts Jan 10, 2026

jakubchlapek reviewed Jan 21, 2026

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

jakubchlapek reviewed Jan 21, 2026

View reviewed changes

Update CHANGELOG.md

d856529

Co-authored-by: Jakub Chłapek <147340544+jakubchlapek@users.noreply.github.com>

dennisbader approved these changes Jan 23, 2026

View reviewed changes

dennisbader added 2 commits January 23, 2026 07:30

Merge branch 'master' into fix/max-samples-per-ts

167bcf4

Merge branch 'master' into fix/max-samples-per-ts

4306d66

dennisbader merged commit 807d22c into unit8co:master Jan 23, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/max_samples_per_ts #2987

Fix/max_samples_per_ts #2987

brunnedu commented Jan 10, 2026

Uh oh!

codecov bot commented Jan 10, 2026 •

edited

Loading

Uh oh!

daidahao commented Jan 10, 2026

Uh oh!

daidahao commented Jan 14, 2026

Uh oh!

brunnedu commented Jan 15, 2026

Uh oh!

Uh oh!

jakubchlapek left a comment

Uh oh!

dennisbader left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix/max_samples_per_ts #2987

Fix/max_samples_per_ts #2987

Conversation

brunnedu commented Jan 10, 2026

Summary

Uh oh!

codecov bot commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

daidahao commented Jan 10, 2026

Uh oh!

daidahao commented Jan 14, 2026

Uh oh!

brunnedu commented Jan 15, 2026

Uh oh!

Uh oh!

jakubchlapek left a comment

Choose a reason for hiding this comment

Uh oh!

dennisbader left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov bot commented Jan 10, 2026 •

edited

Loading