# Task-1  The data analysis workflow and understanding the model and data.

# Import necessary libraries

In [None]:

import sys
import os
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from statsmodels.tsa.arima.model import ARIMA

# Add the parent directory to sys.path
sys.path.append(os.path.abspath('../scripts'))

## Importing functions from 'preprocessing_and_eda.py' module

In [2]:
from preprocessing_and_eda import load_and_preprocess_data,plot_price_trend,fit_arima_model,plot_residuals

## Step 1: Loading and Preprocessing Data

In [None]:

data_path = "../data/BrentOilPrices.csv"
df = load_and_preprocess_data(data_path)


## Step 2: Exploratory Data Analysis (EDA)

### Plot price trend over time

In [None]:
plot_price_trend(df)

## Step 3: Define Analysis Workflow and Assumptions

### Data Analysis Workflow
1. Load and Preprocess Data: Load the Brent oil prices data and preprocess it (date parsing, missing values handling).
2. Exploratory Data Analysis (EDA): Explore general trends and statistical properties of the dataset.
3. Model Selection: Evaluate suitable time series models, such as ARIMA and GARCH.
4. Change Point Analysis: Detect major changes or shifts in Brent oil prices, potentially using models like Bayesian change point detection.
5. Interpretation and Insights: Generate insights on price fluctuations and prepare reports for stakeholders.

### Assumptions and Limitations
- The data is assumed to be a reliable representation of historical Brent oil prices.
- Only historical prices are used; other economic factors are not directly included in this dataset.
- Forecasting may have limitations due to unforeseen political or economic events.

## Step 4: ARIMA Model Explanation and Fitting

### Fit the model and summarize results

In [None]:
arima_model = fit_arima_model(df)

## Step 5: Residuals Analysis

### Plot residuals of the model

In [None]:
plot_residuals(arima_model)

## Step 6: Summary and Next Steps

### Summary and Next Steps
- We've outlined a data analysis workflow and performed basic EDA.
- ARIMA model was fitted to understand potential trends and seasonality.
- Next steps include exploring additional models (e.g., GARCH) and implementing change point analysis to detect significant shifts.
- Document assumptions, limitations, and reporting formats for stakeholders.
