Skip to content

vanraak/datadepot

Repository files navigation

Package datadepot

Package ‘datadepot’

Title DataDepot

Description

The datadepot package provides a collection of datasets used in the book Data Science Foundations and Machine Learning with Python.

URL https://github.com/vanraak/datadepot

Depends Python (>= 3.10) and Pandas (>2.0)

License GPL (>= 2)

Repository Pypi

Authors Jeroen van Raak

Maintainer Jeroen van Raak, j.j.f.vanraak@uva.nl

NeedsCompilation no

Installation

pip install datadepot

or:

conda install conda-forge::datadepot

Usage

import datadepot
df=datadepot.load("<dataset>")

Replace with the name of the dataset, such as "bank", "house", or "churn".

Example

df=datadepot.load('bank') # Load the bank dataset.

Datasets

The following datasets are included:

  • adult
  • bank
  • cereal
  • churn
  • churn_ibm
  • churn_mlc
  • covid
  • credit
  • credit_card
  • cpu
  • diamonds
  • drug
  • gapminder
  • hotel_city
  • hotel_resort
  • house
  • house_price
  • insurance
  • las_vegas
  • loan
  • machine_failure
  • mpg
  • red_wines
  • vehicle
  • white_wines
  • wholesale

Documentation

The full documentation is available at: https://github.com/vanraak/datadepot/blob/main/README.pdf

About

Accompanying package for Data Science Foundations and Machine Learning: Python

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages