Project for Hood College CS 522: Data Mining - Fall 2018
-
Datasets folder: Contains the final .csv dataset (proj_data_final_v1) used for analysis. Subfolders include:
-
Originals folder: Contains the original Farmers Market (FM) dataset and original US Census population dataset. Due to size limitations, original Chronic Disease Indicator (CDI) dataset could not be uploaded. CDI file in this folder represents CDI data only from 2016.
-
Metadata folder: Contains CDI dataset metadata file (json format) and reference article. Also contains webpage references for the FM and population datasets.
-
Processed folder: Contains two interim FM datasets, with and without discretization ranges. Two additional FM datasets have full state names, used for mapping. Also contains an interim CDI dataset that includes CDI abbreviations.
-
-
Jupyter Notebooks folder: Contains final project notebook along with two preliminary notebooks for looking at FM and CDI statistics.
-
shapefiles folder: Contains the shapefiles used for mapping. From the matplotlib GitHub account.