# ECON 138 — Midterm Exam  
### *How Do Maternal and Child Health Outcomes Differ Across Southeast Asia?*

**Objective:**  
This exam will assess your ability to perform *exploratory data analysis (EDA)* and create *visual stories* using real-world microdata.  
You will use the **Demographic and Health Survey (DHS) Phase 7 – Birth Recode (BR)** datasets from Southeast Asian countries to compare maternal and child health outcomes.

## Backgound
The DHS provides nationally representative data on fertility, maternal and child health, and family planning.  
Each “BR” dataset contains one record per live birth reported by women aged 15–49.

| Country | DHS file | Phase | Year |
|----------|-----------|-------|------|
| Cambodia | `KHBR73FL.DTA` | 7 | 2014 |
| Indonesia | `IDBR71FL.DTA` | 7 | 2017 |
| Myanmar | `MMBR71FL.DTA` | 7 | 2016 |
| Philippines | `PHBR71FL.DTA` | 7 | 2017 |
| Timor-Leste | `TLBR71FL.DTA` | 7 | 2016 |

## Instructions

Your output should **not only consist of tables and charts**, but also **narrative insights and interpretations**.  
For every visualization or descriptive table you produce, include a short **Markdown discussion (2–4 sentences)** explaining:
- What the data show  
- How patterns differ across countries or socio-economic groups  
- What potential economic, demographic, or policy interpretations can be drawn  

These narratives are essential for demonstrating your understanding of the data and your ability to communicate findings effectively, similar to how economists write data briefs or empirical summaries.


## Data Preparation
1. Load each `.dta` file using `pandas.read_stata()`.  
2. Create a `country` column (e.g., “Cambodia”, “Indonesia”, etc.) and concatenate all countries into one DataFrame.  
3. Always use survey weights in summaries.

# Tasks

### A. Maternal Demographics
1. Plot a **histogram** and **kernel density plot** of maternal age at birth for each country.  
2. Create **boxplots** of maternal age grouped by country and by **urban/rural **.  
3. Show a **stacked bar chart** of educational attainment by country.  
4. Compute and plot the **mean education level** by wealth quintile across countries.  
5. Visualize **age–education profiles** using density plots.



### B. Fertility and Birth Patterns
1. Plot **birth order distributions** across countries.  
2. Compute **mean birth order by mother’s age group** and plot as a line chart per country.  
3. Create a **heatmap** of mean birth order by education level and wealth quintile (pooled data).  
4. Plot **total number of births per woman** to visualize fertility variation.


### C. Health Service Utilization
1. Plot **average number of ANC visits** by country and by **urban/rural**.  
2. Compute the **share of births in health facilities** by country.  
3. Show a **grouped bar chart** comparing skilled birth attendance by wealth quintile and country.  
4. Scatter maternal age vs. ANC visits colored by residence (urban/rural).

### D. Child Survival Outcomes
1. Compute **infant survival rate** = mean × 100 for each country.  
2. Plot a **bar chart** of survival rate by country.  
3. Create a **heatmap** of survival rate by education and wealth .  
4. Plot survival probability by **maternal age group** to visualize age-related risk.  
5. Compare survival by place of delivery (facility vs. home).

## Evaluation Criteria
Evaluation Rubric

| Component | Points |
|------------|--------|
| Data import, cleaning, and preparation | 15 |
| Variable engineering and weighting | 15 |
| Descriptive statistics | 15 |
| Visualization variety and design | 30 |
| Interpretation and narrative | 20 |
| Organization and readability | 5 |
| **Total** | **100** |