DataSet library is used to load toy datasets in Pharo. The datasets are loaded as DataFrame objects.
To install DataSet, go to the Playground (
Ctrl+OW) in your fresh Pharo image and execute the following Metacello script (select it and press Do-it button or
Metacello new baseline: 'DataSet'; repository: 'github://AtharvaKhare/DataSet'; load.
To see all the available datasets, use
To load a dataset, using
DataSet loadXYZ. For example:
df := DataSet loadBoston.
Loading a new dataset involves downloading it's csv to local filesystem followed by reading it in DataFrame. You can preemptively download datasets by:
"Downloads a single dataset" DataSet downloadBoston. "Downloads all datasets" DataSet downloadAll.
This command skips downloading if dataset already exists. DataSets are stored in
data folder of this repo in your filesystem.