Skip to content
This repository has been archived by the owner on Dec 22, 2022. It is now read-only.

Data for México is wrong after 2020-04-16 #66

Closed
wlmb opened this issue Apr 18, 2020 · 7 comments
Closed

Data for México is wrong after 2020-04-16 #66

wlmb opened this issue Apr 18, 2020 · 7 comments

Comments

@wlmb
Copy link

wlmb commented Apr 18, 2020

All data for México seems wrong in files 2020-04-16.csv, 2020-04-17.csv and 2020-04-18.csv
A typical row reads

MX-AGU,Mexico,Aguascalientes,,0,0,0

The date is missing and there are zeroes for all entries. Yesterday some data was 'missing' but not '0'.

@pablodz
Copy link
Collaborator

pablodz commented Apr 18, 2020

Yes, it's true, we are having trouble running automated scripts. These scripts replace the "missing" to "zero" but we will use blank spaces to indicate absence of data, these days we are fixing it, we already have a program that detects errors. #59

An important fact is that the countries are not reporting the amount to the resolution that we want in the case of recovered. We recommend only using the data corresponding to "Confirmed" and "Deaths"

In the case of Mexico, this data has not been recovered since March, we will try to parse with the data that was recently released. Datos Abiertos México

By now, we have 12.61% of errors in Confirmed cases. Data detailed -> Errors.csv

Data it's still dirty, we need more colaborators to keep clean data.

Our script to detect errors https://bit.ly/2RNkNZc

@wlmb
Copy link
Author

wlmb commented Apr 18, 2020 via email

@pablodz
Copy link
Collaborator

pablodz commented Apr 18, 2020

Is it safe to use the lack of a date as a consistent indicator of missing data?

It would be best to be guided by decreasing data.

Thanks, we're algo looking for extra data to do a correlation study like in China (https://github.com/DataScienceResearchPeru/covid-19_latinoamerica_extra)

Recolección de data por país - DATA (1)
If you know something of these variables please contact us

@carranco-sga
Copy link
Collaborator

I'm maintaining Mexico's data. I've been a bit busy doing some major changes in my own repo in which I scrape the data out of the official pdfs and (soon!) the open data.
I'll push an update for the recent days in a couple of minutes

@carranco-sga
Copy link
Collaborator

carranco-sga commented Apr 18, 2020

I've just pushed the data to the repository, as well as replacing all the "missing" strings to blank spaces in Mexico's info. (8cff184)
Unless there's something additional, I think we can close down this issue.

@pablodz
Copy link
Collaborator

pablodz commented Apr 18, 2020

Yes, I'm creating a script to change 0 to '' (blank spaces), comming soon

@wlmb
Copy link
Author

wlmb commented Apr 18, 2020 via email

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants