---

# ✏️ Exercises: Exploring Other Datasets (scikit-learn)

To extend your practice beyond the Iris dataset, try applying the **same EDA workflow** to other datasets available in `scikit-learn`.  
This will help you generalize the process of **loading data, checking quality, visualizing distributions, detecting correlations, and generating hypotheses**.

## 🔹 Suggested Datasets

1. **Wine Dataset** (`load_wine`)  
   - Classification dataset with 178 samples and 13 features describing chemical properties of wines.  
   - Task: Explore how the three wine classes differ in terms of alcohol content, phenols, and color intensity.  

2. **Breast Cancer Dataset** (`load_breast_cancer`)  
   - Classification dataset with 569 samples and 30 features related to cell nuclei characteristics.  
   - Task: Investigate which features appear most discriminative between malignant and benign tumors.  

3. **Digits Dataset** (`load_digits`)  
   - Classification dataset with 1,797 samples of 8x8 images of handwritten digits.  
   - Task: Perform basic EDA to visualize digit distributions and pixel intensity correlations.  

---

## 🔹 Exercise Instructions

For each dataset:  
1. **Load** the dataset using `from sklearn.datasets import load_<dataset_name>`  
2. Convert it into a **pandas DataFrame** for easier handling.  
3. Perform:  
   - Missing value check  
   - Descriptive statistics  
   - Distribution plots (histograms, boxplots)  
   - Correlation heatmaps  
   - Pairplots (if feasible)  
4. **Compare classes** and propose **hypotheses** about feature importance and separability.  

---

## 📘 Example Starter Code

```python
from sklearn.datasets import load_wine
import pandas as pd

# Load dataset
wine = load_wine()
df_wine = pd.DataFrame(wine.data, columns=wine.feature_names)
df_wine['target'] = wine.target

# Quick check
print(df_wine.head())
print(df_wine['target'].value_counts())



Unnamed: 0,alcohol,malic_acid,ash,alcalinity_of_ash,magnesium,total_phenols,flavanoids,nonflavanoid_phenols,proanthocyanins,color_intensity,hue,od280/od315_of_diluted_wines,proline,target
0,14.23,1.71,2.43,15.6,127.0,2.8,3.06,0.28,2.29,5.64,1.04,3.92,1065.0,0
1,13.2,1.78,2.14,11.2,100.0,2.65,2.76,0.26,1.28,4.38,1.05,3.4,1050.0,0
2,13.16,2.36,2.67,18.6,101.0,2.8,3.24,0.3,2.81,5.68,1.03,3.17,1185.0,0
3,14.37,1.95,2.5,16.8,113.0,3.85,3.49,0.24,2.18,7.8,0.86,3.45,1480.0,0
4,13.24,2.59,2.87,21.0,118.0,2.8,2.69,0.39,1.82,4.32,1.04,2.93,735.0,0
