# Time Series Analysis

## Introduction
Time series analysis is a statistical technique that deals with time series data, or trend analysis. Time series data means that data is in a series of particular time periods or intervals. The data is considered in three types:

1. Time series data: A set of observations on the values that a variable takes at different times.
2. Cross-sectional data: Data of one or more variables, collected at the same point in time.
3. Pooled data: A combination of time series data and cross-sectional data.


## Components of Time Series
### 1. Trend:
A trend exists when there is a long-term increase or decrease in the data. It does not have to be linear. Sometimes we will refer to a trend as "changing direction", when it might go from an increasing trend to a decreasing trend.

### 2. Stationary:
A time series is said to be stationary if its statistical properties such as mean, variance remain constant over time. Most statistical forecasting methods are based on the assumption that the time series can be rendered approximately stationary (i.e., "stationarized") through the use of mathematical transformations. A stationarized series is relatively easy to predict: you simply predict that its statistical properties will be the same in the future as they have been in the past!

### 3. Seasonality:
A seasonal pattern exists when a series is influenced by seasonal factors (e.g., the quarter of the year, the month, or day of the week). Seasonality is always of a fixed and known period.

### 4. Cyclic Pattern:
A cyclic pattern exists when data exhibit rises and falls that are not of fixed period. The duration of these fluctuations is usually of at least 2 years.

### 5. Autocorrelation: 
Autocorrelation is the correlation of a time series with the same time series lagged. It summarizes the strength of a relationship with an observation in a time series with observations at prior time steps.


### 6. Lag:
A lag is a fixed amount of passing time; One set of observations in a time series is plotted (lagged) against a second, later set of data.

### 7. White Noise:
White noise is a series of random numbers with a constant mean and standard deviation. The autocorrelation function of white noise is zero for any lag, except that the autocorrelation function at lag 0 is 1.

### 8. Moving Average:
A moving average is a calculation used to analyze data points by creating a series of averages of different subsets of the full data set. It is also called a moving mean (MM) or rolling mean and is a type of finite impulse response filter.

### 9. ARIMA:
ARIMA stands for AutoRegressive Integrated Moving Average. It is a generalization of the simpler AutoRegressive Moving Average and adds the notion of integration. This acronym is descriptive, capturing the key aspects of the model itself. Briefly, they are:

- AR: Autoregression. A model that uses the dependent relationship between an observation and some number of lagged observations.
- I: Integrated. The use of differencing of raw observations (e.g. subtracting an observation from an observation at the previous time step) in order to make the time series stationary.
- MA: Moving Average. A model that uses the dependency between an observation and a residual error from a moving average model applied to lagged observations.

### 10. Forecasting:
Forecasting is the process of making predictions of the future based on past and present data and most commonly by analysis of trends. A commonplace example might be estimation of some variable of interest at some specified future date.

### 11. Outliers:
An outlier is an observation that lies an abnormal distance from other values in a random sample from a population. In a time series, an outlier is a data point that is significantly different from other data points in the series.




## Time Series Analysis Techniques

### 1. EDAC (Exploratory Data Analysis on Time Series)
Exploratory Data Analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. In time series analysis, EDA is used to understand the data, patterns, and relationships within the data.

### 2. Forecasting (Regression Analysis)
Regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome variable') and one or more independent variables (often called 'predictors', 'covariates', or 'features'). In time series analysis, regression analysis is used to forecast future values based on historical data.
- ARIMA (AutoRegressive Integrated Moving Average)
- SARIMA (Seasonal AutoRegressive Integrated Moving-Average)

### 3. Forecasting (Classification Analysis)
Classification is a type of supervised machine learning algorithm that is used to predict the category of a data point. In time series analysis, classification analysis is used to predict the category of future data points based on historical data.
- SVM (Support Vector Machine)
- Gradient Boosting
- XGBoost
- NN

### Time Series Decomposition
Time series decomposition involves thinking of a series as a combination of level, trend, seasonality, and noise components. Decomposition provides a useful abstract model for thinking about time series generally and for better understanding problems during time series analysis and forecasting.
![image.png](attachment:image.png)

1. Trends (Long-term direction of the series)
2. Seasonality (Regular pattern of up and down fluctuations)
3. Residual (Error of prediction)

### 4. Spectral Analysis
Spectral analysis is the process of determining the frequency domain representation of a time domain signal and most commonly employs the Fourier transform. The Fourier transform converts the time domain signal into the frequency domain representation.
- Cycles (Technique to identify periodicity in time series data)


## Types of Data about Time and Time Series:
### Major Types of Time Data:
1. `Time Series Data`: A set of observations on the values that a variable takes at different times. (e.g. Stock Prices, Weather Data, etc.)
2. `Cross-Sectional Data`: Data of one or more variables, collected at the same point in time. (e.g. Census Data, Survey Data, etc.)
3. `Pooled Data`: A combination of time series data and cross-sectional data. (e.g. Stock Market Data, etc.)

### Other Types of Time Data:
1. Time Series Data
2. Time Stamped Data
3. Time Duration Data   
4. Time Interval Data
5. Time of Day Data
6. Time of Year Data
7. Time of Month Data
8. Time of Week Data
9. Time of Day Data
10. Time of Hour Data
11. Time of Minute Data
12. Time of Second Data
13. Time of Millisecond Data
14. Time of Microsecond Data
15. Time of Nanosecond Data
16. Time of Picosecond Data
17. Time of Femtosecond Data
18. Time of Attosecond Data
19. Time of Zeptosecond Data
20. Time of Yoctosecond Data

