# Introduction

The goal of this project is to create a set of clear and accessible visualizations, together with interactive dashboards, based on the COVID-19 dataset from *Our World in Data*. We focus on showing how the pandemic unfolded across countries and continents, what role demographics played, and how vaccination progressed.  

---

## Dataset

- Source: [Our World in Data – COVID-19 dataset](https://ourworldindata.org/covid-cases)  
- Scope: global coverage of COVID-19 cases, deaths, testing, hospitalizations, and vaccinations, enriched with demographic and economic indicators (population, life expectancy, GDP per capita, etc.).

---

## Project Stages

- **00_Introduction** – project overview and structure  
- **01_DataQuality** – completeness and quality checks, duplicates  
- **02_BasicGraphs** – first visual exploration (static and interactive charts)  
- **03_Dashboard1** – COVID-19 world map  
- **04_Dashboard2** – country trends over time  
- **05_Dashboard3** – top countries in vaccinations (absolute vs. per population)  
- **06_Dashboard3** – top countries in vaccinations (absolute vs. per capita, slider for N)  

---

## Visualizations

The project highlights the following key stories:

- **Which countries have the largest populations?**  
- **How do population size and life expectancy compare across countries?**  
- **How did the number of cases and deaths grow over time in selected countries?**  
- **Which countries vaccinated the highest share of their population?**  

Dashboards then allow users to explore these questions interactively.

---

## Libraries Used

- **pandas** – data processing and analysis  
- **numpy** – numerical operations  
- **seaborn** – static visualizations  
- **matplotlib** – static visualizations and formatting  
- **plotly** – interactive charts  
- **dash** – interactive dashboards  
- **threading** and **socket** – running Dash apps in Colab  
- **google.colab.output** – displaying dashboards inside Google Colab  

---

## Project Structure

- **data/** – contains the source dataset  
  - **COVID_19_DATASET** – COVID-19 dataset (Our World in Data, modified)  

- **notebooks/** – Jupyter notebooks with step-by-step workflow  
  - **00_Introduction.ipynb** – project overview and structure  
  - **01_DataQuality.ipynb** – completeness and quality checks, duplicates  
  - **02_BasicGraphs.ipynb** – first visual exploration (static and interactive charts)  
  - **03_Dashboard1.ipynb** – COVID-19 world map  
  - **04_Dashboard2.ipynb** – country trends over time  
  - **05_Dashboard3.ipynb** – top countries in vaccinations (absolute vs. per population)  

- **assets/** – optional screenshots or images for README  
