-
Notifications
You must be signed in to change notification settings - Fork 18.5k
Date format for Global Recovered counts differs from Global Confirmed and Global Deaths counts #1581
Comments
Also, while not visible, the Province/State key in the recovered file has a leading Secondly, the indexing between the three files should match, like confirmed and deaths do. So for example, Norway is on line 177 in confirmed and deaths, which is good, however, in recovered, it's on line 175. This makes it a lot harder when trying to group the three files together into a single object. |
Agreed on the date format :) |
Recovered is back?! |
I was excited too. I think so as they’ve renamed it to the same style as the two other files and not removed it as they said in their announcement. But it’s not really compatible at all with the two other files. |
It is actually a little updated. The compatibility issue is being caused by the column names. In all others, the date format is "dd/mm/20" while in recovered it is "dd/mm/2020". There is an easy fix to this. If you are cleaning your data in R: colnames(recovered) <- gsub("/2020", "/20", colnames(recovered)) |
@DeeKareithi Well that, and the fact that the indexes don’t match up. In confirmed and deaths, the indexes/line number matches up for the locations, making it easy to merge them together into one. However, with recovered, it’s not. |
For global recovered counts, the date format is "mm/dd/yyyy", but for global confirmed case counts and global death counts, the date format is "mm/dd/yy".
This isn't a major issue, but if you could pick one format for representing dates, it would make it more convenient to make visualizations from this data.
In any case, I really appreciate this data; thank you for maintaining it.
The text was updated successfully, but these errors were encountered: