# Time Series Analysis
Time Series Analysis is a way of studying the characteristics of the response variable concerning time as the independent variable. To estimate the target variable in predicting or forecasting, use the time variable as the reference point. 

TSA represents a series of time-based orders, it would be Years, Months, Weeks, Days, Horus, Minutes, and Seconds. It is an observation from the sequence of discrete time of successive intervals. 

![image.png](attachment:image.png)

Some real-world application of TSA includes weather forecasting models, stock market predictions, signal processing, and control systems.

![image-2.png](attachment:image-2.png)

## Objectives of Time Series Analysis
* To understand how time series works and what factors affect a certain variable(s) at different points in time.
* Time series analysis will provide the consequences and insights of the given dataset’s features that change over time.
* Supporting to derive the predicting the future values of the time series variable.
* Assumptions: There is only one assumption in TSA, which is “stationary,” which means that the origin of time does not affect the properties of the process under the statistical factor.

**To perform the time series analysis, we have to follow the following steps:**

* Collecting the data and cleaning it
* Preparing Visualization with respect to time vs key feature
* Observing the stationarity of the series
* Developing charts to understand its nature.
* Model building – AR, MA, ARMA and ARIMA
* Extracting insights from prediction


**With the help of “Time Series,” we can prepare numerous time-based analyses and results.**

* Forecasting: Predicting any value for the future.
* Segmentation: Grouping similar items together.
* Classification: Classifying a set of items into given classes.
* Descriptive analysis: Analysis of a given dataset to find out what is there in it.
* Intervention analysis: Effect of changing a given variable on the outcome.

## Components of Time Series Analysis
various components of Time Series Analysis:

![image.png](attachment:image.png)

* Trend: In which there is no fixed interval and any divergence within the given dataset is a continuous timeline. The trend would be Negative or Positive or Null Trend
* Seasonality: In which regular or fixed interval shifts within the dataset in a continuous timeline. Would be bell curve or saw tooth
* Cyclical: In which there is no fixed interval, uncertainty in movement and its pattern
* Irregularity: Unexpected situations/events/scenarios and spikes in a short time span.
* Stationary: A time series that has the same statistical properties over time is stationary. The properties remain the same anywhere in the series. Your data needs to be stationary to perform time-series analysis on it. A stationary series has a constant mean, variance, and covariance.

## Data Types of Time Series
there are two major types – stationary and non-stationary.

**Stationary**: A dataset should follow the below thumb rules without having Trend, Seasonality, Cyclical, and Irregularity components of the time series.

* The mean value of them should be completely constant in the data during the analysis.
* The variance should be constant with respect to the time-frame
* Covariance measures the relationship between two variables.

**Non- Stationary**: If either the mean-variance or covariance is changing with respect to time, the dataset is called non-stationary.

![image.png](attachment:image.png)

## Methods to Check Stationarity

During the TSA model preparation workflow, we must assess whether the dataset is stationary or not. This is done using Statistical Tests. There are two tests available to test if the dataset is stationary:

* Augmented Dickey-Fuller (ADF) Test
* Kwiatkowski-Phillips-Schmidt-Shin (KPSS) Test

### Augmented Dickey-Fuller (ADF) Test or Unit Root Test
The ADF test is the most popular statistical test. It is done with the following assumptions:

* Null Hypothesis (H0): Series is non-stationary
* Alternate Hypothesis (HA): Series is stationary
  * p-value >0.05 Fail to reject (H0)
  * p-value <= 0.05 Accept (H1)
  
### Kwiatkowski–Phillips–Schmidt–Shin (KPSS) Test
These tests are used for testing a NULL Hypothesis (HO) that will perceive the time series as stationary around a deterministic trend against the alternative of a unit root. Since TSA is looking for Stationary Data for its further analysis, we have to ensure that the dataset is stationary.

## Converting Non-Stationary Into Stationary

There are three methods available for this conversion – detrending, differencing, and transformation.

**Detrending**
It involves removing the trend effects from the given dataset and showing only the differences in values from the trend. It always allows cyclical patterns to be identified.

![image.png](attachment:image.png)

**Differencing**
This is a simple transformation of the series into a new time series, which we use to remove the series dependence on time and stabilize the mean of the time series, so trend and seasonality are reduced during this transformation.

* Yt= Yt – Yt-1
* Yt=Value with time

![image-2.png](attachment:image-2.png)

**Transformation**
This includes three different methods they are Power Transform, Square Root, and Log Transfer. The most commonly used one is Log Transfer.

## Moving Average Methodology
The commonly used time series method is the Moving Average. This method is slick with random short-term variations. Relatively associated with the components of time series.

The **Moving Average (MA) (or) Rolling Mean**: The value of MA is calculated by taking average data of the time-series within k periods.

The types of moving averages:

* Simple Moving Average (SMA),
* Cumulative Moving Average (CMA)
* Exponential Moving Average (EMA)

### Simple Moving Average (SMA)
The Simple Moving Average (SMA) calculates the unweighted mean of the previous M or N points. We prefer selecting sliding window data points based on the amount of smoothing, as increasing the value of M or N improves smoothing but reduces accuracy.

![image.png](attachment:image.png)

### Cumulative Moving Average (CMA)
The CMA is the unweighted mean of past values till the current time.

![image-2.png](attachment:image-2.png)

### Exponential Moving Average (EMA)
EMA is mainly used to identify trends and filter out noise. The weight of elements is decreased gradually over time. This means It gives weight to recent data points, not historical ones. Compared with SMA, the EMA is faster to change and more sensitive.

α –>Smoothing Factor.

* It has a value between 0,1.
* Represents the weighting applied to the very recent period.

![image-3.png](attachment:image-3.png)