Skip to content
This repository has been archived by the owner on Mar 10, 2023. It is now read-only.

Date format for Global Recovered counts differs from Global Confirmed and Global Deaths counts #1581

Open
MattTriano opened this issue Mar 26, 2020 · 6 comments

Comments

@MattTriano
Copy link

For global recovered counts, the date format is "mm/dd/yyyy", but for global confirmed case counts and global death counts, the date format is "mm/dd/yy".

This isn't a major issue, but if you could pick one format for representing dates, it would make it more convenient to make visualizations from this data.

In any case, I really appreciate this data; thank you for maintaining it.

@ExpDev07
Copy link

ExpDev07 commented Mar 26, 2020

Also, while not visible, the Province/State key in the recovered file has a leading \ufeff, like '\ufeffProvince/State'. This causes CSV parsing to break when you try to access a key with location["Province/State"].

Secondly, the indexing between the three files should match, like confirmed and deaths do. So for example, Norway is on line 177 in confirmed and deaths, which is good, however, in recovered, it's on line 175. This makes it a lot harder when trying to group the three files together into a single object.

@DeeKareithi
Copy link

Agreed on the date format :)

@JeroenKools
Copy link

Recovered is back?!

@ExpDev07
Copy link

I was excited too. I think so as they’ve renamed it to the same style as the two other files and not removed it as they said in their announcement.

But it’s not really compatible at all with the two other files.

@DeeKareithi
Copy link

DeeKareithi commented Mar 26, 2020

It is actually a little updated. The compatibility issue is being caused by the column names. In all others, the date format is "dd/mm/20" while in recovered it is "dd/mm/2020". There is an easy fix to this. If you are cleaning your data in R:

colnames(recovered) <- gsub("/2020", "/20", colnames(recovered))

@ExpDev07
Copy link

ExpDev07 commented Mar 26, 2020

@DeeKareithi Well that, and the fact that the indexes don’t match up. In confirmed and deaths, the indexes/line number matches up for the locations, making it easy to merge them together into one. However, with recovered, it’s not.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants