ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS #2892

Leoyzen · 2016-04-14T04:28:50Z

A new timeseries forecast/decompose model based on Rob.J.Hyndman's paper.
http://robjhyndman.com/papers/complex-seasonality/

It is the same as the tbats model of R's forecast package.

Feature:

Long Seasonality forecast (such as minutes data)
Complex seasonality based on trigonometric transform(such as multiple seasonality/non-interger periods)
BoxCox transform
arma error
exog support

work:

coveralls · 2016-04-14T04:56:23Z

Coverage decreased (-0.02%) to 85.359% when pulling 35ed40c on Leoyzen:tbats into 970e99e on statsmodels:master.

josef-pkt · 2016-05-03T01:14:47Z

statsmodels/tsa/statespace/tbats.py

+
+import numpy as np
+from numpy.linalg import LinAlgError
+import statsmodels.api as sm


AFAICS, this adds a circular import and causes the TravisCI test error.
inside the code we only import directly from the modules (or subpackages)

(I'm not sure what this circular import does and why it fails for some tests but not all.)

I'm looking into it

coveralls · 2016-07-19T06:05:06Z

Coverage decreased (-1.1%) to 88.052% when pulling b5d09f8 on Leoyzen:tbats into 9ccf83f on statsmodels:master.

N-Wouda · 2016-07-19T07:25:11Z

statsmodels/tsa/statespace/tbats.py

+
+        if self.boxcox:
+            endog, lmbda = self.transform_boxcox(self.data.endog)
+            if np.isnan(lmbda) or not (0 <= lmbda <= 1):


As of the next commit you can specify bounds=(0, 1) as an optional argument to transform_boxcox. This will ensure optimisation takes place over the specified interval.

coveralls · 2016-07-19T07:39:06Z

Coverage decreased (-1.1%) to 88.052% when pulling decd0b8 on Leoyzen:tbats into 9ccf83f on statsmodels:master.

coveralls · 2016-07-20T00:51:06Z

Coverage decreased (-1.1%) to 88.067% when pulling 935de44 on Leoyzen:tbats into 9ccf83f on statsmodels:master.

ChadFulton · 2016-07-20T03:37:12Z

I finally had a chance to look at Hyndman's papers and I'm starting to look this over.

ChadFulton · 2016-07-20T03:37:52Z

statsmodels/tsa/statespace/innovation.py

+            elif key == 'transition':
+                self.ssm['transition', :-1, :-1] = value
+            elif key == 'selection':
+                self.ssm['transition', :-1, -1:] = value


I'm guessing this should be 'selection', and the same below in __getitem__?

No, follow the construction of your blog post which you gave to me it should be like this.

I don't see why trying to set the selection should set things in the transition matrix. Can you remind me which blog post? It could have been a typo on my part.

Given this, it looks like you never set the actual selection matrix, which I think would render the model useless? Or am I missing something?

http://nbviewer.jupyter.org/gist/ChadFulton/f0b8fc2e7d47a0103f97
and the origin google groups discuss is here
confuse with the example of AR(1) State Space Model

Oh I see, yes, you're correct; and now I see the setting of the actual selection matrix in the __init__ call, so that should be fine.

Leoyzen · 2016-07-20T04:15:10Z

@ChadFulton Maybe you can help me to look at the start params initialization at arma error and regression part, actually I am not familar with this.

coveralls · 2016-07-20T05:00:43Z

Coverage decreased (-1.1%) to 88.065% when pulling f234733 on Leoyzen:tbats into 9ccf83f on statsmodels:master.

ChadFulton · 2016-07-20T06:16:45Z

statsmodels/tsa/statespace/tbats.py

+    ----------
+    periods:   iterable of float or None or array, optional
+        The period of the seasonal component. Default is None. Can be multiple
+    k:  int or None, optional, should be same length of periods


This should be renamed into a descriptive name.

ChadFulton · 2016-07-20T17:54:41Z

@ChadFulton Maybe you can help me to look at the start params initialization at arma error and regression part, actually I am not familar with this.

Sure - you're asking about how to compute start_params for the ARMA and regression coefficients?

ChadFulton · 2016-07-20T17:56:13Z

I can't quite tell what the status of this model is. It looks pretty complete, but there are no unit tests. Are things generally working, or is there something holding it up?

I know you were running into a problem with "variance initialization of the model", but I'm not sure what that means - is that still an issue?

ChadFulton · 2016-07-20T19:20:17Z

It looks to me like this model pretty much works. Hard to test against the R version of TBATS because they allow larger parameter ranges for alpha and beta. I don't quite understand, but running:

library(forecast)
dta <- read.csv('datasets/macrodata/macrodata.csv')
endog <- log(dta$realgdp) * 400
res <- tbats(endog)

yields the following:

BATS(1, {0,0}, 0.994, -)

Call: tbats(y = endog)

Parameters
  Alpha: 1.23139
  Beta: -0.01928973
  Damping Parameter: 0.994394

Seed States:
            [,1]
[1,] 3134.685159
[2,]    5.292013

Sigma: 3.641562
AIC: 1613.3

alpha > 1 which is not a huge problem if TBATS is allowing the larger parameter range, but I thought beta was always supposed to be positive (e.g. see Hyndman's book, page 45).

I'm not an expert in these models though, so I may have misunderstood the applicable parameter ranges. @Leoyzen @davidljung do you know what's going on with R here?

Leoyzen · 2016-07-21T02:22:29Z

@ChadFulton According to the file checkAdmissibility.R I don't see any limitation to the alpha/beta/gamma, so I just use the information from the book.

Your question is also my problem , it became more different result to TBATS of R.

The model is almost done here , but some issue with arma and variance initial coefs(within initialize_state function). I'm not familar with this, so please help.

The optimize should be done otherwise it takes long time and large nums of iteration to the fit, and I found that it's hard to optimization to converge with lbfgs.

My next goal is to get more docs and right code style of statsmodels.

ChadFulton · 2016-07-21T02:28:52Z

@ChadFulton According to the file checkAdmissibility.R I don't see any limitation to the alpha/beta/gamma, so I just use the information from the book.

Your question is also my problem , it became more different result to TBATS of R.

Actually I think I have found the reference here - take a look at Table 10.1 in Hyndman's book. It looks like he's using the third set of constraints, which relax the positivity constraint if there is damping.

github-advanced-security

CodeQL found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

Leoyzen · 2024-02-28T15:31:45Z

After 8 years, I've making a progress now.
Identifying code by statespace.ExponentialSmoothing, I made some change in order to implement SSOE(or Innovation State Space) within current framework better.

The key change is from the fomula below.

Considering SSOE (or Innovation State space) formula:

$$\displaylines{y_t = w \alpha_{t-1} + \varepsilon_t \\\ \alpha_{t} = F \alpha_{t-1}+ g \varepsilon_t}$$

and the normal state space formula:

$$\displaylines{y_t = Z \alpha_t \\\ \alpha_t = T \alpha_{t-1} + R \eta_{t-1}}$$

according to the timing differences, we can rewrite the normal formual to

$$\displaylines{y_t = ZT \alpha_{t-1} + ZR\eta_t \\\ \alpha_{t} = T \alpha_{t-1} + TR \eta_{t-1} }$$

then we make sure $ZR=1$ and $TR=g$ .

$ZR=1$ can be ensure by adding a hidden state in transition matrix like ExponentialSmoothing does.

and $TR=g$ can be resolved by scipy.linalg.solve.

    def update(self, params, transformed=True, includes_fixed=False, complex_step=False):
       ......
       # exclude ma from caculation.It's ok.
        end = -q if q != 0 else None
        # we assume $TR = g $
        # so we have to resolve R from $R = T^ * g$
        self["selection", 1:end, 0] = la.solve(
            self["transition", 1:end, 1:end], self._internal_selection[:end, 0]
        )
        # we need ensure $ ZR = 1 $. so the first value of selection matrix should be
        # caculate from others.
        self["selection", 0, 0] = 1 - self["design", :, 1:].dot(
            self["selection", 1:, :]
        )

Then the code from can be removed.

def _initialize_constant_statespace(self, initial_state):
        .....
        # Apply the prediction step to get to what we need for our Kalman
        # filter implementation
        # This is not needed anymore because we compute within update step.
        # constant = np.dot(self.ssm["transition"], constant)

        self.initialization.constant = constant

This implement has been verified by both official implements and python implements, also manually caculated.

So it will be nice when a new class such as InnovationRepresentation which inherent from Representation and implement such changes.

@ChadFulton Can you take a look and give some advices before I moving forward?

ChadFulton · 2024-02-29T01:15:11Z

Thanks @Leoyzen for coming back to this, I will take a look soon (unfortunately, it might not be until this weekend)

2. Add fourier seasonal transform 3. Add some related tools

2. fix circular import issue

Leoyzen · 2024-03-05T06:08:42Z

I just created a class named InnovationsMLEModel to simplify the procedure implementing the SSOE Model in Kalman Filters.

The params and results has been verified with both Python tbats and R forecast package.

…aining

Leoyzen changed the title ~~ENH: New Seasonal TimeSeries Model:TBATS/BATS/MB~~ ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS Apr 14, 2016

Leoyzen mentioned this pull request Apr 14, 2016

ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS #2893

Closed

josef-pkt mentioned this pull request Apr 25, 2016

FAQ: modelling seasonality in time series #2906

Open

josef-pkt added comp-tsa type-enh labels May 3, 2016

josef-pkt reviewed May 3, 2016
View reviewed changes

josef-pkt mentioned this pull request May 3, 2016

Box-Cox transform (some code needed: lambda estimator) #1309

Closed

N-Wouda reviewed Jul 19, 2016
View reviewed changes

Leoyzen mentioned this pull request Jul 19, 2016

WIP: Power transforms (specifically: Box-Cox) mixin #2925

Closed

ChadFulton reviewed Jul 20, 2016
View reviewed changes

github-advanced-security bot found potential problems Feb 28, 2024

View reviewed changes

Leoyzen added 14 commits March 5, 2024 11:00

1. Add Tbats Model

773fec5

2. Add fourier seasonal transform 3. Add some related tools

Add Tbats Model and tools to statespace api file

b27a655

1. fix build error

32a7aad

2. fix circular import issue

inplace boxcox with new branch feature of statsmodels

e24ffd0

fixed new boxcox mixin issues

b9772ba

change the default boxcox method to loglik to reduce the iter nums

89c737d

Change boxcox bounds behavior to new boxcox mixin

3cbe525

BUG: exog handing in start_params if data missing

87066bd

using std ordereddict

850bade

update the model defination

3a5f539

new innovation state space form

1b12c3d

some typo fix

9312332

adopt to new InnovationsMLEModel

9bacab3

move error epsilon to the last row/col

e5b9e1d

Leoyzen force-pushed the tbats branch from a01dc1d to e5b9e1d Compare March 5, 2024 03:01

Leoyzen added 11 commits March 7, 2024 13:38

preposition check admissible, and speed up tbats caculation.

1ea26bb

rego parameter assignment to avoid complex and covariance problem.

2254129

set bias adjustment default to true

14f5fb6

start params default to array to avoid dtype changes during training

5179bc4

reorg paramter update

ba33529

fix filter method when using initial state

2edd40b

remove unnessaray loglikeobs method override

a24c852

adding logic of set_xlim in plot_components

c88dd90

reorder imports

b594f80

fixup! start params default to array to avoid dtype changes during tr…

f2c58ef

…aining

fix slice assignment issues.

be6a9ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS #2892

ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS #2892

Leoyzen commented Apr 14, 2016

coveralls commented Apr 14, 2016

josef-pkt May 3, 2016

Leoyzen May 24, 2016

coveralls commented Jul 19, 2016 •

edited

N-Wouda Jul 19, 2016

coveralls commented Jul 19, 2016 •

edited

coveralls commented Jul 20, 2016 •

edited

ChadFulton commented Jul 20, 2016

ChadFulton Jul 20, 2016

Leoyzen Jul 20, 2016

ChadFulton Jul 20, 2016

Leoyzen Jul 20, 2016

ChadFulton Jul 20, 2016

Leoyzen commented Jul 20, 2016

coveralls commented Jul 20, 2016 •

edited

ChadFulton Jul 20, 2016

ChadFulton commented Jul 20, 2016

ChadFulton commented Jul 20, 2016

ChadFulton commented Jul 20, 2016

Leoyzen commented Jul 21, 2016

ChadFulton commented Jul 21, 2016

github-advanced-security bot left a comment

Leoyzen commented Feb 28, 2024 •

edited

ChadFulton commented Feb 29, 2024

Leoyzen commented Mar 5, 2024

ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS #2892

Are you sure you want to change the base?

ENH: New Seasonal TimeSeries Model:TBATS/BATS/MTBATS #2892

Conversation

Leoyzen commented Apr 14, 2016

coveralls commented Apr 14, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls commented Jul 19, 2016 • edited

Choose a reason for hiding this comment

coveralls commented Jul 19, 2016 • edited

coveralls commented Jul 20, 2016 • edited

ChadFulton commented Jul 20, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Leoyzen commented Jul 20, 2016

coveralls commented Jul 20, 2016 • edited

Choose a reason for hiding this comment

ChadFulton commented Jul 20, 2016

ChadFulton commented Jul 20, 2016

ChadFulton commented Jul 20, 2016

Leoyzen commented Jul 21, 2016

ChadFulton commented Jul 21, 2016

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Leoyzen commented Feb 28, 2024 • edited

ChadFulton commented Feb 29, 2024

Leoyzen commented Mar 5, 2024

coveralls commented Jul 19, 2016 •

edited

coveralls commented Jul 19, 2016 •

edited

coveralls commented Jul 20, 2016 •

edited

coveralls commented Jul 20, 2016 •

edited

Leoyzen commented Feb 28, 2024 •

edited