# 🦠 COVID-19 Global Data Tracker
**Author**: Don  
**Description**: This notebook analyzes global COVID-19 data: cases, deaths, and vaccinations using Python (pandas, matplotlib, seaborn).

## 1️⃣ Data Collection
We use the [Our World in Data](https://ourworldindata.org/covid-data) COVID-19 dataset.
- File: `owid-covid-data.csv`
- Source: Our World in Data

In [None]:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load dataset
df = pd.read_csv("owid-covid-data.csv")

# Display column names and first few rows
print(df.columns)
df.head()

## 2️⃣ Data Exploration
- View column info and check for missing values.

In [None]:
df.info()
missing = df.isnull().sum()
missing[missing > 0]

## 3️⃣ Data Cleaning
- Convert date column to datetime
- Filter countries: Kenya, USA, India
- Drop rows with missing critical data

In [None]:
df['date'] = pd.to_datetime(df['date'])
countries = ['Kenya', 'United States', 'India']
df = df[df['location'].isin(countries)]
df = df.dropna(subset=['total_cases', 'total_deaths'])
df.fillna(0, inplace=True)

## 4️⃣ Exploratory Data Analysis (EDA)
### Total COVID-19 Cases Over Time

In [None]:
plt.figure(figsize=(10, 6))
for country in countries:
    country_df = df[df['location'] == country]
    plt.plot(country_df['date'], country_df['total_cases'], label=country)

plt.title('Total COVID-19 Cases Over Time')
plt.xlabel('Date')
plt.ylabel('Total Cases')
plt.legend()
plt.grid(True)
plt.show()

### Daily New Cases Comparison

In [None]:
plt.figure(figsize=(10, 6))
for country in countries:
    country_df = df[df['location'] == country]
    plt.plot(country_df['date'], country_df['new_cases'], label=country)

plt.title('Daily New COVID-19 Cases')
plt.xlabel('Date')
plt.ylabel('New Cases')
plt.legend()
plt.grid(True)
plt.show()

## 5️⃣ Vaccination Progress
### Cumulative Vaccinations

In [None]:
plt.figure(figsize=(10, 6))
for country in countries:
    country_df = df[df['location'] == country]
    plt.plot(country_df['date'], country_df['total_vaccinations'], label=country)

plt.title('Cumulative COVID-19 Vaccinations')
plt.xlabel('Date')
plt.ylabel('Total Vaccinations')
plt.legend()
plt.grid(True)
plt.show()

## 7️⃣ Insights & Reporting
**Key Findings**:
- 📌 India had the highest cumulative cases among the selected countries.
- 📌 The USA had the fastest vaccination rollout initially.
- 📌 Kenya showed a slower vaccination increase over time.