# 🌍 COVID-19 Global Trends Analysis

This notebook analyzes COVID-19 cases, deaths, and vaccination trends across selected countries using data from [Our World in Data](https://ourworldindata.org/coronavirus).

## 1️⃣ Data Loading

In [None]:
import pandas as pd

# Load the dataset (ensure 'owid-covid-data.csv' is in your working directory)
df = pd.read_csv("owid-covid-data.csv")

# Preview the dataset
df.shape, df.columns.to_list(), df.head()

## 2️⃣ Data Cleaning

In [None]:
# Focus on selected countries
countries = ['Kenya', 'India', 'United States']
df = df[df['location'].isin(countries)]

# Drop missing dates
df = df.dropna(subset=['date'])

# Convert date to datetime
df['date'] = pd.to_datetime(df['date'])

# Fill missing numeric values
cols_to_fill = ['total_cases', 'total_deaths', 'new_cases', 'new_deaths', 'total_vaccinations']
df[cols_to_fill] = df[cols_to_fill].fillna(0)

df.head()

## 3️⃣ Exploratory Data Analysis (EDA)

In [None]:
import matplotlib.pyplot as plt
import seaborn as sns

plt.figure(figsize=(12,6))
for country in countries:
    subset = df[df['location'] == country]
    plt.plot(subset['date'], subset['total_cases'], label=country)
plt.title("Total COVID-19 Cases Over Time")
plt.xlabel("Date")
plt.ylabel("Total Cases")
plt.legend()
plt.grid(True)
plt.show()

In [None]:
# Calculate death rate
df['death_rate'] = df['total_deaths'] / df['total_cases']
df['death_rate'] = df['death_rate'].fillna(0)
df[['location', 'date', 'death_rate']].tail()

## 4️⃣ Vaccination Progress

In [None]:
plt.figure(figsize=(12,6))
for country in countries:
    subset = df[df['location'] == country]
    plt.plot(subset['date'], subset['total_vaccinations'], label=country)
plt.title("Vaccination Progress Over Time")
plt.xlabel("Date")
plt.ylabel("Total Vaccinations")
plt.legend()
plt.grid(True)
plt.show()

## 5️⃣ Key Insights & Observations

- Kenya’s vaccine rollout started later than India and the U.S.
- Death rates decreased steadily as vaccinations increased.