# Exploratory Data Analysis (EDA) on Currency Data

In this notebook, we will perform exploratory data analysis on the historical currency data to understand its structure, visualize trends, and identify any patterns that may inform our model development.

In [1]:
# Import necessary libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

# Set visualization style
sns.set(style='whitegrid')

In [2]:
# Load the dataset
data_path = '../data/transformed_currency_data.csv'
currency_data = pd.read_csv(data_path)

# Display the first few rows of the dataset
currency_data.head()

In [3]:
# Check for missing values
missing_values = currency_data.isnull().sum()
missing_values[missing_values > 0]

In [4]:
# Visualize the distribution of currency exchange rates
plt.figure(figsize=(12, 6))
sns.histplot(currency_data['Exchange_Rate_Column'], bins=30, kde=True)
plt.title('Distribution of Currency Exchange Rates')
plt.xlabel('Exchange Rate')
plt.ylabel('Frequency')
plt.show()

In [5]:
# Time series analysis of currency exchange rates
plt.figure(figsize=(14, 7))
plt.plot(currency_data['Date'], currency_data['Exchange_Rate_Column'], label='Exchange Rate')
plt.title('Currency Exchange Rate Over Time')
plt.xlabel('Date')
plt.ylabel('Exchange Rate')
plt.xticks(rotation=45)
plt.legend()
plt.show()

In [6]:
# Correlation heatmap
plt.figure(figsize=(12, 10))
correlation_matrix = currency_data.corr()
sns.heatmap(correlation_matrix, annot=True, fmt='.2f', cmap='coolwarm', square=True)
plt.title('Correlation Heatmap')
plt.show()

## Conclusion

In this exploratory data analysis, we have visualized the distribution of currency exchange rates, analyzed trends over time, and examined correlations between different features. These insights will help guide the development of our predictive model.