Feat/sample weights #2404

dennisbader · 2024-06-05T11:59:36Z

Checklist before merging this PR:

Mentioned all issues that this PR fixes or addresses.
Summarized the updates of this PR under Summary.
Added an entry under Unreleased in the Changelog.

Fixes #1175, fixes #2107.

(this is a continuation of #2362)

Summary

adds support for training regression models with sample weights.
fit() now has two new parameters sample_weight and val_sample_weight which allow to pass sample weights for training and validation (val_sample_weights only for models that support validation sets)

sample_weight
            Optionally, some sample weights to apply to the target `series` labels.
            They are applied per observation, per label (each step in `output_chunk_length`), and per component.
            If a string, then the weights are generated using built-in weighting functions. The available options are
            `"linear_decay"` or `"exponential_decay"`. The weights are only computed the longest series in `series`,
            and then applied globally to all `series` to have a common time weighting.
            If a `TimeSeries` or `Sequence[TimeSeries]`, then those weights are used. The number of series must
            match the number of target `series` and each series must contain at least all time steps from the
            corresponding target `series`. If the weight series only have a single component / column, then the weights
            are applied globally to all components in `series`. Otherwise, for component-specific weights, the number
            of components must match those of `series`.
val_sample_weight
            Same as for `sample_weight` but for the evaluation dataset.

VascoSch92 · 2024-06-05T12:06:49Z

darts/models/forecasting/regression_model.py

+                f"Possible values are: equal, linear_decay, exponential_decay.",
+            )
+        elif isinstance(sample_weight, TimeSeries):
+            # The error is caught later, should we still verify it here?


why this is an error if in the docstring you are saying that If a TimeSeries is passed, then those weights are used.?

This is a placeholder, we are trying to think if it's worth it to check that the time index of the passed series matches the index of the training series for example.

codecov · 2024-06-05T12:27:31Z

Codecov Report

Attention: Patch coverage is 96.29630% with 7 lines in your changes missing coverage. Please review.

Project coverage is 93.78%. Comparing base (05f6ddf) to head (ae06f47).
Report is 1 commits behind head on master.

❗ Current head ae06f47 differs from pull request most recent head 0b9b9cc

Please upload reports for the commit 0b9b9cc to get more accurate results.

Files	Patch %	Lines
darts/utils/multioutput.py	77.77%	4 Missing ⚠️
darts/utils/data/tabularization.py	97.05%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2404      +/-   ##
==========================================
+ Coverage   93.77%   93.78%   +0.01%     
==========================================
  Files         138      138              
  Lines       14384    14492     +108     
==========================================
+ Hits        13488    13592     +104     
- Misses        896      900       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

madtoinou

Great work!

One minor comment about the index for the generated weights; maybe we should consider the two most extremes index values rather than the length of the longest series? WDYT?

darts/models/forecasting/regression_model.py

darts/utils/data/tabularization.py

Anton Ragot and others added 24 commits December 19, 2023 17:43

Adding building blocks of weight samples

35ac768

Adding exponential decay logic

5668c2e

Linter

4ed8058

Linter flake

7d46079

Linter flake 2

b669439

Linter isort

2bdb71e

Adding Timeseries support

2a33bbb

Adding first test for equal weights

f33a429

Adding first round of tests

48126e6

merge

a0a61fb

working session with le M

75a41be

Adding other tests

a916a1f

Resolve linter issues

a2a9d9b

Resolve flake

605172e

Merge branch 'master' into master

dee5df9

Merge branch 'master' into master

b17ec26

Resolving conflicts

c8a287b

Conflicts again

8b4d006

Removing conflict mistake

5639bff

fixing some tests

c3c61d8

fixing catboost tests

2ffc577

Merge branch 'master' into master

c6395e8

Merge branch 'master' into feat/sample_weights

fb0be36

fix tests from new val set logic

849099d

dennisbader requested a review from madtoinou as a code owner June 5, 2024 11:59

VascoSch92 reviewed Jun 5, 2024

View reviewed changes

dennisbader added 3 commits June 5, 2024 15:36

some cleaning up of unused functions

c272064

correct sample weight options in docs

8d49c58

make simple sample weights work with fit

172514b

dennisbader added 12 commits June 6, 2024 10:25

integrate sample weights into lagged data creation

41191bd

added support for multi horizon per time step weights

c5754fc

add lgbm catboost to tests

08c5c97

remove unused tests

e41c7e4

add tabularization tests

8d95774

remove unused test

a4ca131

update docs

f3cd56a

update regression model tests

daab756

support val set weights

67ae30e

use correct static covariates shape in lagged data creation

be92763

update docs

974e376

update changelog

ae06f47

This was referenced Jun 7, 2024

Feat/ Adding sample weight #2362

Closed

Feat/sample weight torch #2410

Merged

madtoinou approved these changes Jun 17, 2024

View reviewed changes

darts/models/forecasting/regression_model.py Outdated Show resolved Hide resolved

darts/utils/data/tabularization.py Show resolved Hide resolved

dennisbader added 3 commits June 17, 2024 13:22

update docstrings

4897a18

update changelog

06cdb39

Merge branch 'master' into feat/sample_weights

0b9b9cc

dennisbader merged commit 6835c36 into master Jun 17, 2024
9 checks passed

dennisbader deleted the feat/sample_weights branch June 17, 2024 12:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/sample weights #2404

Feat/sample weights #2404

dennisbader commented Jun 5, 2024 •

edited

Loading

VascoSch92 Jun 5, 2024

madtoinou Jun 5, 2024

codecov bot commented Jun 5, 2024 •

edited

Loading

madtoinou left a comment

Feat/sample weights #2404

Feat/sample weights #2404

Conversation

dennisbader commented Jun 5, 2024 • edited Loading

VascoSch92 Jun 5, 2024

Choose a reason for hiding this comment

madtoinou Jun 5, 2024

Choose a reason for hiding this comment

codecov bot commented Jun 5, 2024 • edited Loading

Codecov Report

madtoinou left a comment

Choose a reason for hiding this comment

dennisbader commented Jun 5, 2024 •

edited

Loading

codecov bot commented Jun 5, 2024 •

edited

Loading