# 🧰 3.0 Introduction to Data‑Handling

Welcome to the wild swamp of data wrangling! In this notebook, we’ll chart the river from raw inputs to tasty summaries, using nutrition and food‑science examples (with a few hippo-themed jokes to keep things hippo-thetical).

### Learning Objectives

- Map out the end‑to‑end steps from raw inputs to aggregated summaries  
- Identify and connect to key nutrition and food‑science data sources  
- Get familiar with the Python libraries and environment setup  

### Key Datasets

- **`hippo_nutrients.csv`**: Experimental macronutrient and micronutrient profiles for hippopotamus feeding trials  
- **FoodData Central API**: REST API access to USDA’s FoodData Central for comprehensive food composition data  
- **`clinical_trial_data.xlsx`**: Simulated dietary intervention results from a randomized trial  
- **Local SQLite database (`fns_db.sqlite`)**: Consolidated tables for nutrient reference values and cohort metadata  



In [None]:
%run ../../bootstrap.py    # installs requirements + editable package

import fns_toolkit as fns

---
### Verify Your Environment

Run the following to confirm your versions and ensure everything’s ready for our hippo expedition.

In [None]:
import sys
import pandas as pd
import numpy as np

print(f"Python version: {sys.version.split()[0]}")
print(f"pandas version: {pd.__version__}")
print(f"numpy version: {np.__version__}")

---
### 🚀 Roadmap

Here’s our journey through Part 3:

1. **3.1 Principles of Tidy Data**: Learn the four rules that make data hippo-riffic.  
2. **3.2 Importing & Connecting**: Pull in datasets from CSVs, APIs, and databases.  
3. **3.3 Data Cleaning & Quality Checks**: Handle missing values and inconsistent measurements.  
4. **3.4 Reshaping & Merging Datasets**: Tame wide and long data, and join multiple tables.  
5. **3.5 Transformation & Feature Engineering**: Create new variables like flavanol ratios.  
6. **3.6 Aggregation & Summarization**: Summarise intake by cohort and export results.  

Ready to get your feet wet? Let’s dive into 3.1 next — may your code be ever crash-free and your data as tidy as a basking hippo!