[ENH] Expose seasonality parameters of ProphetPiecewiseLinearTrendForecaster #5834

sbuse · 2024-01-25T15:56:43Z

The PR exposes the seasonal parameters of the ProphetPiecewiseLinearTrendForecaster.

Reference Issues/PRs

The PR is the result of the discussion in this (#5592).

What does this implement/fix? Explain your changes.

This is a simple change to allow the user to define what the seasonality parameters should be. @tpvasconcelos suggested they should be

daily_seasonality=False,
weekly_seasonality=False,
yearly_seasonality=False

but as the discussion (#5592) showed it is not clear if this is the best setting.

Does your contribution introduce a new dependency? If yes, which one?

No there are no new dependencies.

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

I added no new test since it is such a simple change.

Any other comments?

PR checklist

For all contributions

I've added myself to the list of contributors with any new badges I've earned :-)
How to: add yourself to the all-contributors file in the sktime root directory (not the CONTRIBUTORS.md). Common badges: code - fixing a bug, or adding code logic. doc - writing or improving documentation or docstrings. bug - reporting or diagnosing a bug (get this plus code if you also fixed the bug in the PR).maintenance - CI, test framework, release.
See here for full badge reference
Optionally, I've added myself and possibly others to the CODEOWNERS file - do this if you want to become the owner or maintainer of an estimator you added.
See here for further details on the algorithm maintainer role.
The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.

For new estimators

I've added the estimator to the API reference - in docs/source/api_reference/taskname.rst, follow the pattern.
I've added one or more illustrative usage examples to the docstring, in a pydocstyle compliant Examples section.
If the estimator relies on a soft dependency, I've set the python_dependencies tag and ensured
dependency isolation, see the estimator dependencies guide.

…ub.com/sbuse/sktime into expose-seasonality-piecewise-detrender

fkiraly · 2024-01-25T18:39:06Z

quick question, does this change the default behaviour? I would hope not?

sbuse · 2024-01-25T20:01:56Z

The default behavior does not change. It just makes the parameters accessible.

sktime/forecasting/trend/_pwl_trend_forecaster.py

fkiraly

In-principle ok, though I would suggest to improve the docstring (non-blocking)

sbuse · 2024-01-26T11:26:00Z

Thanks @fkiraly for asking for a more precise description. Reading into the exact meaning of the parameters made me realise this change will just create a lot of confusion and constraining the model just to do a piece wise linear fit seems a lot clearer.

If you agree, I will create a new branch and constrain the model just to do the piece wise linear fit.

tpvasconcelos · 2024-01-26T14:23:27Z

If you agree, I will create a new branch and constrain the model just to do the piece wise linear fit.

I think that this is a better idea and would make this trend forecaster's behaviour more intuitive 👍

fkiraly · 2024-01-26T14:40:10Z

hmmmmm - could you explain the two options?

I will also reopen the PR even if we do not merge it, until the discussion is complete, as closed PRs have much lower visibility for developers. If discussionn gets longer, we should open an issue.

fkiraly

should not be merged until discussion is complete

tpvasconcelos · 2024-01-27T10:15:03Z

@fkiraly fair enough (good policy!)

For the record, I will quickly re-iterate and extend on some of the points I mentioned in the original PR.

Why I'm not a fan of the current solution

I don't think that a separate ProphetPiecewiseLinearTrendForecaster class should have been introduced. Few reasons for this:

Lots of code repetition between this and the original Prophet class implementation
In the future, users will want more control over how this works internally and will ask for more parameters to be exposed (like the seasonality parameters @sbuse is suggesting in this PR).
- To give another example, users might also ask to expose holiday-related parameters if, for instance, the effect size of some special days/periods in the series has a significant effect on the fitted trend component and the user doesn't want this since he/she is already modelling special days/periods some other way.
Implementing this class, leaves a door open for someone to create yet more separate ProphetLogisticTrendForecaster, ProphetDeseasonalizer, ProphetRegressors, ProphetHolidays classes, and so on...
All of the extra class implementations listed in 3. suffer from the same problems described in 1. and 2.

My preferred solution

I think that the solution to this problem that would be simplest, cleanest, and clearest-to-the-end-user, is to simply expose an extra parameter to the existing Prophet class that allows users to extract the structural component(s) they want to get out of the Prophet model.

For instance

# this:
forecaster = Prophet(extract_components="trend")
# would be the same as this:
forecaster = ProphetPiecewiseLinearTrendForecaster()

and

# this:
forecaster = Prophet(yearly_seasonality=False, extract_components="trend")
# would be the same as this:
forecaster = ProphetPiecewiseLinearTrendForecaster(yearly_seasonality=False)

and

# this:
forecaster = Prophet(yearly_seasonality=False, extract_components=["daily", "weekly"])
# would be the same as this
forecaster = ProphetDeseasonalizer(yearly_seasonality=False)

The implementation is very straightforward and clear:

Add an extra extract_components parameter to the Prophet class
Change this line in _fbprophet.py to extract the special component instead of "yhat" (default)

sktime/sktime/forecasting/base/adapters/_fbprophet.py

Line 197 in c540a77

y_pred = out.loc[:, "yhat"]

i.e.,

-  y_pred = out.loc[:, "yhat"] 
+  component = self.extract_component or "yhat"
+  y_pred = out.loc[:, component]

This ☝️ could be extended to accept multiple components (e.g., extract_components=["daily", "weekly"]) but needs a bit more speccing since it depends on whether the components are additive or multiplicative which can to be inferred from the fitted self._forecaster. The point is: it can be done!

sbuse · 2024-01-27T12:39:10Z

I would like to answer why we should not expose the seasonal components and rather fix them all to false and disable any seasonal modeling. When I suggested implementing a piecewise linear detrender it was due to the fact there is no such thing in the arsenal of sktime nor sci-kit learn. What I was looking for was a function that just models the trend and I would then compose the result with another forecaster or deseasonalizer in a pipeline.

What the current implementation does, is to fit a complex model to the data and it is not clear if there are seasonal components added or not. I see two problems with this.

This is confusing for the user. Why do I need to specify a seasonal behavior of my detrender when I want to compose it with another deseasonalizer later in my pipeline?
The seasonal modeling could affect the result of the detrending (we saw this happening in the experiment I posted). I think the detrender should just model the trend and the residual will be handled by the next step in the pipeline.

To me, it was frustrating not finding a way to do this type of detrending and that is why I suggested it. How to best add this to the code base, I don't know but having something with a clear intended usage is appealing to me.

fkiraly · 2024-01-27T16:19:44Z

Hm, I think this is a very interesting discussion. Let me see if I understand things correctly, please let me know if not.

@sbuse's original motivation was to have a piecewise linear interpolator or forecaster - this did not exist, and the quickest way he thought was getting it from Prophet with most features turned off, since it is a component of prophet but not available easily on its one anywhere. @sbuse is not actually interested in Prophet as a whole, just in getting a component for use in a large pipeline.
@tpvasconcelos is concerned about multiplication of classes, but acknowledges that at the moment it is not easy to get components from the current Prophet interface in sktime. He proposes to add a feature to the existing Prophet that allows to obtain components, including a piecewise linear trend but also others.

Am I understanding well?

If yes, I do think both viewpoints are valid and not contradictory.
That is:

it makes sense to have a class that just does "piecewise linear trend forecast" - prophet or not
it would be nice to have trends obtainable from prophet

Some semi-ordered thoughts:

from a policy standpoint, sktime encourages contributing classes as long as it is well-described what they do. So even if classes were multiplicative, if someone thinks this is exactly what they need/want (and they commit to contribute/maintain), then it is fine to add it. This is slightly different from how sklearn manages contributions (there, the bar is quite high to add anything).
Personally, I do think piecewise linear trend as its own component makes sense. It is a bit unfortunate that there is no separate implementation, but using a constrained prophet instance is better than not having it.
It also makes sense to have a decomposition estimator for the prophet model. However, @tpvasconcelos, I wonder, would that not be more sth like a transformer? For example, look at variational mode decomposition, VmdTransformer.
I also agree that it makes sense to have seasonality defaults as False in the piecewise linear trend estimator, since users will expect it to not have any additional components by default. However, the estimator has been released, so if we want to change that, we need to go through a deprecation cycle. The easiest way would be to introduce it as a parameter, and change the default, the earliest point for such a change at current is 0.28.0, if a warning message and the parameter is added in 0.27.0 or earlier.

fkiraly · 2024-01-27T16:24:11Z

apologies for the ping, @hliebert, I meant @tpvasconcelos. The reason is very mundane, GitHub web API has an auto-complete dropdown menu where possible pings are ordered by - not sure - likelihood estimated by an AI or similar. I misclicked.

I wonder though whether the model in question knows something. I would find it scary if you actually end up finding this discussion highly relevant for you, @hliebert.

tpvasconcelos · 2024-01-27T16:34:57Z

@fkiraly good summary and I agree with your points!

i.e.,

It makes sense to keep this piecewise linear detrender in place since users don't care and don't need to know that the internal implementation is coming from Prophet. They simply want to use a piecewise linear detrender and it makes sense to make it available as a standalone class 👍
That said, I also agree with you and @sbuse that the seasonal components should be turned off by default since this is not obvious and not the expected default behaviour.
Shame for the deprecation cycle but it makes sense

tpvasconcelos · 2024-01-27T16:55:20Z

It also makes sense to have a decomposition estimator for the prophet model. However, @tpvasconcelos, I wonder, would that not be more sth like a transformer? For example, look at variational mode decomposition, VmdTransformer.

@fkiraly I think it would be used in a transformer however I'm not sure what the best way to implement this is. I usually use the Detrender transformer whenever I want to remove a component from a ts, even if that component is not a trend.

A bit unrelated to this conversation, but I think that it's a shame that the Detrender class was named this way 😄 because it really is much more generalised than that. Its implementation just fits any forecaster to the data and returns the in-sample residuals (using either an additive or multiplicative model). Sure it works as a "de-tender" if the forecaster is a trend-based forecaster but it works just as well with all other forecasters.

So, back to your question, here's a toy example of how I could use the Prophet components with the Detrender transformer:

forecaster = TransformedTargetForecaster(
    [
        # Remove the trend component w/ a linear Lasso detrender
        ("detrender": Detrender(TrendForecaster(Lasso()))),

        # Remove daily, weekly, and yearly seasonal components using Prophet
        ("deseasonalizer-dwy": Detrender(Prophet(trend="flat", extract_components=["daily", "weekly", "yearly"]))),

        # Forecast the residuals
        ("forecaster", StatsForecastAutoARIMA()),
    ]
)

I hope you see now why I wish Detrender was named something more generic like Remover (naming is hard!)

Does this make sense to you?

fkiraly · 2024-01-27T17:06:00Z

Shame for the deprecation cycle but it makes sense

Well, we try to not accidentally impact users' downstream code without giving advance warning. Not everyone has the capacity to run the full staging/testing/deploy/monitor mlops cycle, and with a sufficiently large user base there is always someone who (a) relies heavily on any given component and (b) ends up getting their pipeline killed if the change were breaking and unannounced...

On the other hand, two months are not as long as one might think.

fkiraly · 2024-01-27T17:09:12Z

I usually use the Detrender transformer whenever I want to remove a component from a ts, even if that component is not a trend.

Yes, "subtractor" would be more accurate, at the cost of being orders of magnitude more confusing...

We did have this conversation at the very start, btw - Residualator? TakeResiduals?
Remover sounds sth like out of a mafia movie.
A slightly better option would be RemoveNoun, but if Noun = Trend, then we're back at Detrender...

But agreed that naming is hard.

tpvasconcelos · 2024-01-27T17:24:48Z

Subtractor doesn't work when model="multiplicative" and my teammates would kill me if I ever named a class Residualator... DetrenderEtAl it is!

fkiraly · 2024-01-27T20:25:28Z

DetrenderButAlsoResidualsComputatorInGeneral

sbuse · 2024-01-27T21:04:50Z

Thanks @tpvasconcelos for the example. Now I got how you want to extract and use the other parts of the model. I wonder though if there are no tools for a Fourier-sum in the scikit learn toolbox.

Some semi-ordered thoughts:

Personally, I do think piecewise linear trend as its own component makes sense. It is a bit unfortunate that there is no separate implementation, but using a constrained prophet instance is better than not having it.

We could also try to wrap another implementation of a piecewise linear regression that is not from the prophet model. Personally, I would like that a lot more even though prophet has shown to work quite well. A quick search revealed this code base (https://github.com/chasmani/piecewise-regression)

fkiraly · 2024-01-27T21:42:13Z

A quick search revealed this code base (https://github.com/chasmani/piecewise-regression)

Hm, that seems not to be scikit-learn compliant.

If you can find a scikit-learn compliant piecewise linear regressor, you can plug it into TrendForcaster to get a piecewise linear forecaster (which you could then also use in a Detrender...)

sbuse · 2024-01-29T15:22:06Z

I also agree that it makes sense to have seasonality defaults as False in the piecewise linear trend estimator, since users will expect it to not have any additional components by default. However, the estimator has been released, so if we want to change that, we need to go through a deprecation cycle. The easiest way would be to introduce it as a parameter, and change the default, the earliest point for such a change at current is 0.28.0, if a warning message and the parameter is added in 0.27.0 or earlier.

@fkiraly I wonder how we should proceed to set the seasonality defaults to False. Adding the parameters, then changing the default and then hiding them again sounds cumbersome but if we have to do it we could use this PR to add the parameters.

fkiraly · 2024-01-29T23:10:53Z

Adding the parameters, then changing the default and then hiding them again sounds cumbersome but if we have needed we could use this PR to add the parameters.

Imo that's the simplest compliant pathway to the end state where they are internally different and not exposed.

I would aim for a differet end state, exposed but default as False, that's one step shorter.

Yes, this PR could be the start - we could even add a warning right away that the default will change to False in 0.28.0. The typical trick is to set a default of None, and raise the warning only if the value is None (because other value means it is set by user).

fkiraly

Thanks - almost done, but I think the warning trigger condition is not right.

The warning should be triggered in every case where the user has code that changes logic witih 0.28.0. I've added a recipe here on how to ensure that: #5875

The case we need to cover is where the user does not set the seasonality parameter explicitly. Under the current condition, no user would see the warning with the version that contains this PR.

fkiraly

Thx

fkiraly

The conition is now right, but now the requirement "values of self params should always mirror values of init params" is violated.

I've created an example specifically for htis case and would appreciate feedback on how easy it is to understand!
https://www.sktime.net/en/latest/developer_guide/deprecation.html#id1

Of course I can also make the change (it is small), but (a) it's probably interesting to do and (b) it would be great if we can "test" the new example in the developer guide.

sbuse · 2024-02-14T13:15:38Z

@fkiraly Thanks for the example and the doc extension. It makes the deprecation very clear and i will change the script accordingly.

Could you elaborate why the trick with self._parameter is necessary? What do you gain compared to overriding self.parameter? Maybe you could also put a short explanation in the template description.

fkiraly · 2024-02-14T13:51:40Z

Could you elaborate why the trick with self._parameter is necessary?

That's coming from an overriding sklearn interface expectation, namely that __init__ params are (a) immediately written to self, and (b) never changed from their original value. If we would not do the "trick", then at some point self.parameter has a different value than what was passed to __init__.

Good idea to add that to the docs. Would you like to add an explanatory sentence at the end of the first example, or elsewhere were it might be useful (and not to distracting)? The best place would be the point at which the reader starts to ask the question, but late enough so they have had time to digest the example.

That is, the best place might be the point at which you started to wonder.

sbuse added 3 commits January 25, 2024 16:20

exposing seasonality params

8aa2034

Merge branch 'expose-seasonality-piecewise-detrender' of https://gith…

d4945e2

…ub.com/sbuse/sktime into expose-seasonality-piecewise-detrender

Merge branch 'sktime:main' into expose-seasonality-piecewise-detrender

b1a8a00

sbuse requested review from achieveordie, benHeid, fkiraly and yarnabrina as code owners January 25, 2024 15:56

fkiraly added module:forecasting forecasting module: forecasting, incl probabilistic and hierarchical forecasting enhancement Adding new functionality labels Jan 25, 2024

fkiraly reviewed Jan 26, 2024

View reviewed changes

sktime/forecasting/trend/_pwl_trend_forecaster.py Outdated Show resolved Hide resolved

fkiraly approved these changes Jan 26, 2024

View reviewed changes

sbuse closed this Jan 26, 2024

fkiraly reopened this Jan 26, 2024

fkiraly requested changes Jan 26, 2024

View reviewed changes

sbuse added 2 commits January 30, 2024 17:35

Merge branch 'sktime:main' into expose-seasonality-piecewise-detrender

8600c1f

add warning if seasonality is different than default

98f3a69

fkiraly requested changes Feb 2, 2024

View reviewed changes

sbuse added 2 commits February 5, 2024 16:12

changing warning condition

9d3fac4

setting stacklevel in warning

02bc8b7

fkiraly approved these changes Feb 11, 2024

View reviewed changes

fkiraly requested changes Feb 11, 2024

View reviewed changes

adding proper parameter handling

31d639c

fkiraly previously approved these changes Feb 17, 2024

View reviewed changes

add more todos for safety

02120a8

fkiraly dismissed their stale review via 02120a8 February 18, 2024 16:39

fkiraly and others added 2 commits February 18, 2024 17:39

Merge branch 'main' into pr/5834

98612fc

[AUTOMATED] update CONTRIBUTORS.md

40888c6

fkiraly merged commit 9bb3766 into sktime:main Feb 18, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Expose seasonality parameters of ProphetPiecewiseLinearTrendForecaster #5834

[ENH] Expose seasonality parameters of ProphetPiecewiseLinearTrendForecaster #5834

sbuse commented Jan 25, 2024 •

edited

fkiraly commented Jan 25, 2024

sbuse commented Jan 25, 2024

fkiraly left a comment

sbuse commented Jan 26, 2024

tpvasconcelos commented Jan 26, 2024

fkiraly commented Jan 26, 2024

fkiraly left a comment

tpvasconcelos commented Jan 27, 2024

sbuse commented Jan 27, 2024

fkiraly commented Jan 27, 2024 •

edited

fkiraly commented Jan 27, 2024 •

edited

tpvasconcelos commented Jan 27, 2024

tpvasconcelos commented Jan 27, 2024

fkiraly commented Jan 27, 2024

fkiraly commented Jan 27, 2024

tpvasconcelos commented Jan 27, 2024

fkiraly commented Jan 27, 2024

sbuse commented Jan 27, 2024

fkiraly commented Jan 27, 2024

sbuse commented Jan 29, 2024 •

edited

fkiraly commented Jan 29, 2024

fkiraly left a comment

fkiraly left a comment

fkiraly left a comment

sbuse commented Feb 14, 2024

fkiraly commented Feb 14, 2024 •

edited

[ENH] Expose seasonality parameters of ProphetPiecewiseLinearTrendForecaster #5834

[ENH] Expose seasonality parameters of ProphetPiecewiseLinearTrendForecaster #5834

Conversation

sbuse commented Jan 25, 2024 • edited

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

For all contributions

For new estimators

fkiraly commented Jan 25, 2024

sbuse commented Jan 25, 2024

fkiraly left a comment

Choose a reason for hiding this comment

sbuse commented Jan 26, 2024

tpvasconcelos commented Jan 26, 2024

fkiraly commented Jan 26, 2024

fkiraly left a comment

Choose a reason for hiding this comment

tpvasconcelos commented Jan 27, 2024

Why I'm not a fan of the current solution

My preferred solution

sbuse commented Jan 27, 2024

fkiraly commented Jan 27, 2024 • edited

fkiraly commented Jan 27, 2024 • edited

tpvasconcelos commented Jan 27, 2024

tpvasconcelos commented Jan 27, 2024

fkiraly commented Jan 27, 2024

fkiraly commented Jan 27, 2024

tpvasconcelos commented Jan 27, 2024

fkiraly commented Jan 27, 2024

sbuse commented Jan 27, 2024

fkiraly commented Jan 27, 2024

sbuse commented Jan 29, 2024 • edited

fkiraly commented Jan 29, 2024

fkiraly left a comment

Choose a reason for hiding this comment

fkiraly left a comment

Choose a reason for hiding this comment

fkiraly left a comment

Choose a reason for hiding this comment

sbuse commented Feb 14, 2024

fkiraly commented Feb 14, 2024 • edited

sbuse commented Jan 25, 2024 •

edited

fkiraly commented Jan 27, 2024 •

edited

fkiraly commented Jan 27, 2024 •

edited

sbuse commented Jan 29, 2024 •

edited

fkiraly commented Feb 14, 2024 •

edited