# Machine Temperature Dataset – Exploration

This notebook explores the `machine_temperature_small.csv` dataset, a subset of the NAB dataset.  
We’ll perform basic data inspection, visualization, and compute rolling statistics to observe trends and variability.


In [None]:
# Import libraries
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

In [None]:
# load data
url = "https://raw.githubusercontent.com/hjalalin/nab-anomaly-detection/main/data/realKnownCause/machine_temperature_system_failure.csv"
df = pd.read_csv(url)
df['timestamp'] = pd.to_datetime(df['timestamp'])
df.set_index('timestamp', inplace=True)


## Quick Overview
Inspecting the shape, column types, and statistical summary of the data.


In [None]:
print("Shape:", df.shape)
print('')
print('Info:')
print(df.info())
print('')
print('statistical summary')
print(df.describe())


## Plot Time Series
Visualize the temperature trend over time to look for patterns or abnormalities.


In [None]:
plt.figure(figsize=(14, 4))
plt.plot(df.index, df['value'], label='Machine Temperature')
plt.title("Machine Temperature Over Time")
plt.xlabel("Time")
plt.ylabel("Temperature (°C)")
plt.legend()
plt.tight_layout()
plt.show()
