Retrieves "county-level processed" files from Yu-Group (Berkeley) github repo and then compares files, producing:
- log with differences noted
- csv file with pairwise differences for each observation (cases or deaths for a given day)
- heatmaps of various slices of pairwise difference matrix
- csv file with processed data in a common format. This file might be useful for visually inspecting the counts for counties with large discrepancies.
4/27/2020 functionality: nytimes_infection vs usafacts_infection are compared.
Usage: bin/compareCSV.pl -outdir <dir>
-excludeIdentical (default) or -noex: suppress printing identical entries -fetch or -nofetch: hook to avoid multiple wgets (for testing)