Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

US states no longer available in time series data??? #1578

Open
sean-mcclure opened this issue Mar 26, 2020 · 9 comments
Open

US states no longer available in time series data??? #1578

sean-mcclure opened this issue Mar 26, 2020 · 9 comments

Comments

@sean-mcclure
Copy link

@sean-mcclure sean-mcclure commented Mar 26, 2020

With the latest change to format, the new time series files no longer list the US states. What happened to the US state time series data?

@drcanak
Copy link

@drcanak drcanak commented Mar 26, 2020

There are endless posts in the "Issues" regarding changes to the data, and how those changes are breaking all manner of analyses people were doing that relied on this data.

To your question, the State level data you are hoping to find (as I am), is no longer updated and available as it once was. The best right now are the daily reports. The trouble there is that the CSV files change from one day to the next, with regard to what columns are there and in what order those columns appear. So it's not as easy as pulling them all down, and simply merging them together.

Just go through the "Issues" to see what a hot mess this became for people who have come to rely on this data. I don't get it. JH is full of super smart people. How they made such a mess of this, so quickly, is anyone's guess.

If anyone has any R code that can pull these daily update files down, clean them, and get them merged into one big, flat file, I sure would be appreciative. The time series at the State level is very helpful for modeling/planning purposes. My R skills are just not that good.

@mdibenigno
Copy link

@mdibenigno mdibenigno commented Mar 26, 2020

Couldn't agree more @drcanak !!! I've been patiently waiting so I can refresh my R scripts with state-level updates.. but not cool that it has been so long. I did find this data source which could be a good substitute but am hesitant to have to recode everything... https://covidtracking.com/

@rks125
Copy link

@rks125 rks125 commented Mar 26, 2020

I’ve already posted temporary csv files that give you what you had before. Use if you like. Included power query if you want to update yourself. I will update daily until JH resolves issue.

https://www.soothsawyer.com/john-hopkins-time-series-data-confirmed-case-csv-after-march-22-2020/?github=4

@drcanak
Copy link

@drcanak drcanak commented Mar 26, 2020

Yes,

Thank you @rsk125. I did see a post you made elsewhere, grabbed your files, fumbled around a bit not knowing anything about PowerQuery (oh, you have to install it ;-) ), and was able to get sheets reflective of the updates. Thank you for this!

@bfosten
Copy link

@bfosten bfosten commented Mar 26, 2020

In case it's helpful for folks encountering this issue, we've set up a new repo, CovidAPI, which provides time series data for states and provinces (see Regions), as well as other formats of the Johns Hopkins data.

@sean-mcclure
Copy link
Author

@sean-mcclure sean-mcclure commented Mar 27, 2020

Thanks @bfosten, the coviddata/covid-api is indeed much better.

covid19.js now bases its data source from this API as it appears much more reliable. Please don't change the formatting :)

@sean-mcclure
Copy link
Author

@sean-mcclure sean-mcclure commented Mar 29, 2020

And then he goes and changes the URLs, and apparently something with the formatting since now the data cannot be fetched. I give up. It appears it is too much for anyone to leave their original data formatting/destination alone. Data sources that change like this are useless.

@bfosten
Copy link

@bfosten bfosten commented Mar 29, 2020

@sean-mcclure Apologies for this. Having "API" in the repo name was confusing people who just wanted CSVs, so it has been renamed from covid-api to coviddata (which is the same formatting as the GitHub organization name). The format shouldn't have changed; I just tried using covid19.js using the URLs, and the README examples I tried seem to be working, AFAICT. I've opened a PR with the changes, in case it's useful.

@kedionai
Copy link

@kedionai kedionai commented Mar 29, 2020

No worries. I had to change the links for the CSV fetches but once I did that everything worked again. Thanks for not changing the formatting.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
6 participants