pycleandata

Retrieve and clean data sets for use in machine learning experiments. See also pygendata.

Usage

To process all configured data sets:

$ python3 cleandata.py

Or to specify a single data set using the key from data.yml:

$ python3 cleandata.py <dataset_key>

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
_cache		_cache
_notes		_notes
offline_data		offline_data
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
cleandata.py		cleandata.py
data.yml		data.yml
dataset.py		dataset.py
requirements.txt		requirements.txt