# 🧪 Analysing Food Composition Data with pandas

In this notebook, we'll work with a small food composition dataset. You'll learn how to:
- Load a dataset using `pandas`
- Explore and clean data
- Calculate summary statistics
- Create simple visualisations


## 📥 Load the Data

In [None]:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load dataset (use your own path or upload to Colab if necessary)
df = pd.read_csv('food_sample_data.csv')
df

## 📊 Basic Exploration

In [None]:
df.info()
df.describe()

## 🍭 Visualise Sugar Content by Product

In [None]:
plt.figure(figsize=(10, 5))
sns.barplot(data=df, x='Product', y='Sugar_g')
plt.title('Sugar Content per 100g/ml by Product')
plt.ylabel('Sugar (g)')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()

## 📂 Mean Nutrient Content by Category

In [None]:
df.groupby('Category')[['Energy_kcal', 'Sugar_g', 'Protein_g']].mean()

## ✅ Summary
- You loaded and explored a simple food composition dataset
- Used `pandas` for grouping and summary statistics
- Created bar plots using `matplotlib` and `seaborn`

This type of analysis is useful in both public health and food product development contexts.