In [1]:
# EWMA

$\text{EWMA_t} = \alpha y_t + (1-\alpha)\text{EWMA}_{t-1}$

- $\text{EWMA_t}$ is the weighted average for the current time point
- $\alpha$ is the degree to which $y_t$ is weighted, and the inverse to which $\text{EWMA}_{t-1}$ is weighted
- $y_t$ is the actual value of the time-series at time point $t$
- $\text{EWMA}_{t-1}$ is the result of the EWMA equation for the prior time point

In [2]:
# Regression

<h4>$y = mx + b$</h4>

- $y$ is the predicted value
- $m$ is the slope of the regression line
- $b$ is the bias, i.e., intercept

In [3]:
# Autoregression

<h4>$y_{t} = c + \phi_{1}y_{t-1} + \phi_{2}y_{t-2} + \dots + \phi_{p}y_{t-p} + \varepsilon_{t}$</h4>

- $y_t$ is the value of the time-series at time point $t$
- $c$ is a constant (an intercept)
- $\phi_1, \phi_2, \ldots, \phi_p$ are the coefficients for lags $y_{t-1}, y_{t-2}, \ldots, y_{t-p}$
- $y_{t-1}, y_{t-2}, \ldots, y_{t-p}$ are the actual values at lags $\text{t-1}$, $\text{t-2}$, etc.
- $p$ is the number of lags included
- $\varepsilon_t$ is the prediction error at time $t$

In [4]:
# Autocorrelation

<h3>$\rho_k = \frac{ Cov(y_t, y_{t-k}) }{ \sqrt{ Var(y_t) \cdot Var(y_{t-k}) } }$</h3>

- $\rho_k$ is the autocorrelation between time points $t$ and $k$
- $y_t$ is the value at time $t$
- $t_{t-k}$ is the value of the time series at time $t$ lagged by $k$ periods
- $Cov(y_t - y_{t-k})$ is the covariance between $y_t$ and $y_{t-k}$
- $Var(y_t)$ and $Var(y_{t-k})$ are the variances of $y_t$ and $y_{t-k}$ respectively

In [5]:
# Single Exponential Smoothing

$\begin{align*}
\hat{y}_t & = \alpha y_{t-1} + (1-\alpha) \hat{y}_{t-1} \\
\hat{y}_{t+1} & = \alpha y_t + (1-\alpha) \hat{y}_t \\
\end{align*}$

- $\hat{y}_t$ and $\hat{y}_{t-1}$ are the predictions at time $t$ and $t-1$ respectively
- $\alpha$ is the smoothing constant, by which the most recent time point is weighted, and by which the prior time point's prediction is inversely weighted

$\begin{align*}
\hat{y}_3 & = \alpha y_2 + (1-\alpha) \ell_2 \\
 & = \alpha y_2 + (1-\alpha)[\alpha y_1 + (1-\alpha)^2 \ell_0] \\
\hat{y}_4 & = \alpha y_3 + \alpha(1-\alpha) y_2 + \alpha(1-\alpha)^2 y_1 + (1-\alpha)^3 \ell_0
\end{align*}$

- $y_3$ and $y_4$ are the predictions for time-steps $t=3$ and $t=4$ respectively
- $\alpha$ is the smoothing constant
- $\ell_t$ is the smoothed forecast at time $t$

$\begin{align*}
  \text{Forecast equation}  && \hat{y}_{t+h} & = \ell_{t}\\
  \text{Smoothing equation} && \ell_{t}        & = \alpha y_{t} + (1 - \alpha)\ell_{t-1},
\end{align*}$

- $\hat{y}_{t+h}$ is the predicted value at time point $t$ plus $h$ steps ahead
- $\ell_t$ is the smoothed forecast at time $t$
- $\alpha$ is the smoothing constant

In [6]:
# Adjusted R^2

<h4>$\bar{R}^2 = 1 - \frac{(1-R^2)(T-1)}{T-k-1}$</h4>

- $T$ is the total number of time points
- $k$ is the number of parameters

In [None]:
# MSE

<h4>$$MSE = \frac{1}{n} \sum_{i=1}^n (y_i - \hat{y}_i)^2$$</h4>

In [7]:
# RMSE

<h4>$$RMSE = \sqrt{MSE} = \sqrt{ \frac{1}{n} \sum_{i=1}^n (y_i - \hat{y}_i)^2 }$$</h4>

In [8]:
# MAE

$$MAE = \frac{1}{n} \sum_{i=1}^n (y_i - \hat{y}_i)$$

In [None]:
# MAPE

<h4>$$MAPE = \frac{1}{n} \sum_{i=1}^n \frac{y_i - \hat{y}_i}{y_i} \times 100$$</h4>

In [9]:
# Double Exp Smoothing

$\begin{align*}
  \text{Forecast equation}&& \hat{y}_{t+h} &= \ell_{t} + hb_{t} \\
  \text{Level equation}   && \ell_{t} &= \alpha y_{t} + (1 - \alpha)(\ell_{t-1} + b_{t-1})\\
  \text{Trend equation}   && b_{t}    &= \beta^*(\ell_{t} - \ell_{t-1}) + (1 -\beta^*)b_{t-1}
\end{align*}$

- $\hat{y}_{t+h}$ is the prediction for $h$ steps ahead of time point $t$
- $\ell_t$ is the level (the smoothed forecast) at time $t$
- $h$ is the number of days ahead of $t$ to forecast for
- $b_t$ is an estimate of the slope (trend) at time point $t$
- $\alpha$ is the smoothing constant
- $\beta^*$ is the smoothing parameter for the trend

In [10]:
# Damped Double Exp Smoothing

$\begin{align*}
  \text{Forecast equation}&& \hat{y}_{t+h} &= \ell_{t} + (\phi+\phi^2 + \dots + \phi^{h}) ~b_{t} \\
  \text{Level equation}   && \ell_{t} &= \alpha y_{t} + (1 - \alpha)(\ell_{t-1} + \phi ~b_{t-1})\\
  \text{Trend equation}   && b_{t} &= \beta^*(\ell_{t} - \ell_{t-1}) + (1 -\beta^*) ~\phi ~b_{t-1} \\
\end{align*}$

- $\phi$ is the dampening constant