This repo is two things:
- A rust script for converting the special-formatted NOAA CDO data into CSVs
- A few node scripts for creating DuckDBs out of these data parquet files*
*This repo does not include CSV -> Parquet processing, but that's because you can grab this utility instead.
- Create a Makefile for downloading the NOAA data, for now you can grab it from the source (you want
ghcnd-stations.txtandghcnd_gsn.tar.gz). - Publish the built csv/parquet/duckdb artifacts to GitHub releases, maybe.