## Overview of ARMA

The ARMA model stands for Autoregressive Integrated Moving Average. It is a popular statistical model used in time series forecasting, particularly for stationary time series data. ARMA combines two simpler models:

- **Autoregressive (AR) model:** This uses past values (lags) of the time series itself to predict future values.
- **Moving Average (MA) model:** This uses past forecast errors (residuals) to improve predictions.

The term "Integrated" in ARIMA refers to making a time series stationary by differencing; however, ARMA assumes the time series is already stationary.

In essence, ARMA models the current value of a time series as a combination of previous values and past errors, suitable for data whose statistical properties, like mean and variance, do not change over time.

## What is a Stationary Time Series?

A stationary time series is a sequence of data points whose statistical properties, such as mean, variance, and autocorrelation, are constant over time. This stability means that patterns in the data do not systematically change, making the process easier to model and forecast.

Non-stationary data often show trends or seasonal patterns. In ARMA modeling, the data needs to be stationary, which may require preprocessing steps like differencing to remove trends.

## Autoregressive (AR) Process

The AR process predicts the current value based on a linear combination of previous values. For an AR process of order p, denoted as AR(p), the current value is calculated using the last p observations weighted by coefficients (usually called phi coefficients).

For example, an AR(2) model predicts the current value based on the immediate past two values. The model "feeds back" on itself, meaning the series depends on its own history.

Higher order AR models (larger p) include more past values, capturing longer dependencies in the data.

## Moving Average (MA) Process

In contrast, an MA process models the current value based on past forecast errors (called residuals or white noise) rather than past values themselves. An MA process of order q, MA(q), uses the last q error terms weighted by coefficients (theta coefficients) to explain the current observation.

This helps correct for noise or shocks that affected previous observations and improves the model by smoothing out erratic movements.

A zero-order MA or AR process is equivalent to pure white noise (randomness without structure).

## Combining AR and MA: ARMA Process

An ARMA process of order (p, q), written ARMA(p, q), combines both the AR(p) and MA(q) processes. This means the model uses both lagged observations and past errors to predict the current value.

- Setting q = 0 yields a pure AR model.
- Setting p = 0 yields a pure MA model.
- Choosing p and q suitably allows modeling a wide range of stationary time series more flexibly and accurately.

The ARMA model can represent any zero-mean stationary process to a desired accuracy by adjusting p, q, and the phi and theta coefficients.

## Understanding Model Order (p and q)

- **Order p** is the number of past values (lags) used in the AR part.
- **Order q** is the number of past errors used in the MA part.

Identifying the correct values of p and q is crucial for a good model.

## Tools for Order Identification: ACF and PACF

Two key analytical tools help identify the order of AR and MA components from observed data:

- **Autocorrelation Function (ACF):** Measures the correlation of a time series with its own lagged values. For an MA(q) process, the ACF cuts off sharply after lag q, meaning only the first q autocorrelations are non-zero, then zero afterwards.
- **Partial Autocorrelation Function (PACF):** Measures the correlation between the time series and its lagged values, removing the effects of intervening lags. For an AR(p) process, the PACF cuts off sharply after lag p, meaning only the first p partial autocorrelations are non-zero, then zero for higher lags.

In simple terms:

- ACF helps identify the MA order (q).
- PACF helps identify the AR order (p).

For mixed ARMA models, the interpretation is more complex, and model diagnostics or information criteria (like AIC or BIC) are often used.

## Practical Notes

- High-order AR or MA models (e.g., p or q = 10) are rare because distinguishing the exact order becomes difficult and models can overfit.
- The starting point for identifying orders p and q is usually to visually inspect ACF and PACF plots from the data.
- The ARMA model is most effective when applied to stationary time series; otherwise, differencing or other preprocessing is needed (ARIMA includes this integrated step).

## Summary

- The ARMA model is a flexible, powerful tool for modeling stationary time series by combining autoregressive and moving average components.
- It relies on understanding past values and past errors.
- Order selection is facilitated by examining autocorrelations (ACF) and partial autocorrelations (PACF).
- ARMA can approximate any zero-mean stationary time series given appropriate parameters.

Sources: 

[1](https://www.ureason.com/resources/understanding-auto-regressive-moving-average/)
[2](https://en.wikipedia.org/wiki/Autoregressive_model)
[3](https://rainbow.ldeo.columbia.edu/~alexeyk/Papers/AndersonOD1975.pdf)
[4](https://www.tigerdata.com/learn/stationary-time-series-analysis)
[5](https://espressomd.github.io/tutorials/error_analysis/error_analysis_part2.html)
[6](https://towardsdatascience.com/a-step-by-step-guide-to-calculating-autocorrelation-and-partial-autocorrelation-8c4342b784e8/)
[7](https://ieeexplore.ieee.org/document/1007146/)
[8](https://apxml.com/courses/time-series-analysis-forecasting/chapter-4-arima-models-forecasting/arma-models)
[9](https://www.geeksforgeeks.org/data-science/arma-time-series-model/)
[10](https://aws.amazon.com/what-is/autoregressive-models/)