- Iris Flower Dataset: Suitable for demonstrating simple machine learning classification tasks
- Property Value Small: A toy dataset to show the linear relationship between house size and price
- Property Value: A dataset containing ~1,100 entries with 4 features (year_built, volume_interior, distance, lot_size) and 1 label (taxable_value)
- Name Gender Dataset: A csv file with ~ 8000 firstnames and the respective gender male (m) or female (f)
- Name Region Dataset: A csv file with ~ 8000 firstnames and the respective region they belong to (English, French, German, Spanish, Italian, African, Eastasia, Middleeast, Slavic)
- Spam classification dataset: ~5000 SMS text chunks that are classified as either spam or no-spam