Scripts and workflows for obtaining and formating datasets from external projects for use in GUODA
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
BHL
Genbank
SRTM
iDigBio
wikidata for now, make reference to http site to avoid #51 Jun 29, 2018
.gitignore
LICENSE
README.md

README.md

GUODA Datasets

DOI

This repository holds code for retrieving and formatting datasets for use with the GUODA service. Right now the target is to take data and generated well- formed Spark dataframes and write them out as parquet files.

This is also a place to discuss what data should be made availible in GUODA and what format it should be in.