# 00 — Project Description

## 1. Introduction and Motivation
The **<u>OECD Composite Leading Indicators (CLI)</u>** are designed to anticipate turning points in overall **<u>economic activity</u>** relative to trend. They combine multiple **<u>leading indicators</u>** (e.g., order books, confidence indices, production expectations) to provide early-cycle signals.

After the global shock in **<u>2020</u>**, many economies exhibited divergent recovery paths. Understanding how **<u>CLI</u>** behaved across countries can reveal early signs of slowdown or rebound.
This project explores and visualizes OECD **<u>CLI</u>** dynamics across countries and examines their relationships with key **<u>macroeconomic indicators</u>**.

---

## 2. Research Questions
- **RQ1:** How did the **<u>CLI</u>** change **before and after 2020**?
- **RQ2:** Which **<u>countries</u>** experienced the **largest variations** in **<u>CLI</u>** during this period?
- **RQ3:** Is there a significant **<u>correlation</u>** between **<u>CLI</u>** and **<u>GDP growth</u>** (or other **<u>macroeconomic indicators</u>**)?

---

## 3. Objectives
This analysis aims to:
1. Understand the **<u>structure</u>** and **<u>behavior</u>** of OECD **<u>CLI</u>** data.
2. Visualize **<u>pre-/post-2020</u>** trends across multiple **<u>countries</u>**.
3. Identify countries with the **<u>highest fluctuations</u>**.
4. Examine the **<u>relationship</u>** between **<u>CLI</u>** and **<u>GDP growth</u>** (and related indicators).

---

## 4. Methodology Overview
We follow a structured process aligned with **<u>IMRAD</u>** and **<u>CRISP-DM</u>**:
1. **<u>Data Understanding</u>** — review dataset structure and **<u>field definitions</u>**.
2. **<u>Data Preparation</u>** — handle **<u>missing values</u>**, normalize time fields, filter relevant series.
3. **<u>Exploratory Analysis</u>** — time-series visualization and **<u>country-level comparison</u>**.
4. **<u>Correlation Study</u>** — quantify relationships between **<u>CLI</u>** and **<u>GDP growth</u>** (and others).
5. **<u>Discussion & Conclusion</u>** — interpret findings, note **<u>limitations</u>**, outline **<u>future work</u>**.

---

## 5. Expected Outcomes
- Clear trends in **<u>CLI</u>** during and after **<u>2020</u>**.
- Cross-country visualization of **<u>recovery dynamics</u>**.
- Statistical insight into whether **<u>CLI</u>** helps anticipate **<u>GDP growth</u>**.
- A **<u>reproducible</u>** analysis pipeline with code, figures, and documentation.

---

## 6. Project Structure
```text
oecd-leading-indicators/
│
├── data/
│   ├── MEI_20022020103548670.csv
│   └── MEI_26032020094401290.csv
│
├── notebooks/
│   ├── 00_project_description.ipynb
│   ├── 01_intro_and_dataset.ipynb
│   ├── 02_data_dictionary.ipynb
│   ├── 03_cli_trends_analysis.ipynb
│   ├── 04_correlation_study.ipynb
│   └── 05_summary_and_discussion.ipynb
│
├── src/
│   └── utils.py
│
├── reports/
│   └── .gitkeep
│
├── figs/
│   └── .gitkeep
│
├── README.md
└── .gitignore
```

---

## 7. Next Steps
At this stage, the project framework and research questions have been established.
The next phase focuses on performing the core analysis tasks:

1. Complete the **<u>data preparation</u>** for OECD **<u>CLI</u>** fields.
2. Clean and preprocess the dataset for **<u>trend analysis</u>**.
3. Visualize **<u>country-level CLI changes</u>** to identify major patterns.
4. Conduct **<u>correlation studies</u>** between **<u>CLI</u>** and other **<u>macroeconomic indicators</u>**, and summarize findings.

---

## 8. References
- **OECD Composite Leading Indicators (CLI)** — Official OECD source describing the concept and methodology of CLI.
  [https://data.oecd.org/leadind/composite-leading-indicator-cli.htm](https://data.oecd.org/leadind/composite-leading-indicator-cli.htm)
- **Kaggle Dataset (Leading Indicators – OECD)** — Primary dataset used in this project, containing monthly CLI data for multiple countries.
  [https://www.kaggle.com/datasets/alenavorushilova/leading-indicators-oecd](https://www.kaggle.com/datasets/alenavorushilova/leading-indicators-oecd)
- **OECD MEI (Main Economic Indicators) Portal** — Provides related macroeconomic indicators such as GDP, which are used for correlation analysis.
  [https://stats.oecd.org/](https://stats.oecd.org/)
- **CRISP-DM Framework** — Data science process model that guides this project’s workflow from understanding to evaluation.
  [https://en.wikipedia.org/wiki/Cross-industry_standard_process_for_data_mining](https://en.wikipedia.org/wiki/Cross-industry_standard_process_for_data_mining)
- **IMRAD Structure (Scientific Writing Model)** — Scientific paper structure used to organize this project into logical sections (Introduction, Methods, Results, Discussion).
  [https://en.wikipedia.org/wiki/IMRAD](https://en.wikipedia.org/wiki/IMRAD)

