Skip to content
This repository has been archived by the owner on Dec 5, 2022. It is now read-only.

Check if deaths <= cases <= tests #391

Closed
jzohrab opened this issue Aug 9, 2020 · 8 comments
Closed

Check if deaths <= cases <= tests #391

jzohrab opened this issue Aug 9, 2020 · 8 comments
Labels
duplicate This issue or pull request already exists from-cds Transferred from https://github.com/covidatlas/coronadatascraper validation

Comments

@jzohrab
Copy link
Contributor

jzohrab commented Aug 9, 2020

Original issue https://github.com/covidatlas/coronadatascraper/issues/185, transferred here on Friday Mar 20, 2020 at 05:51 GMT


If data is provided the above check must be true, or the data is invalid

@jzohrab jzohrab added from-cds Transferred from https://github.com/covidatlas/coronadatascraper validation labels Aug 9, 2020
@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

I thought about this a bit more and it may not work for a single day. I think you'd have to look at this cumulatively since most places are reporting active cases while deaths and tests are cumulative.

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

Additionally check that deaths are not decreasing. At least then something needs to be double checked.

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

I don't think you can say the data is "invalid". It is what it is at this point. Reporting delays, etc are likely to cause some odd reporting of numbers. I have simply been telling folks, expect to see odd numbers from time to time and hopefully it will clear up over a few days time, e.g. when cases were 'transferred' from cruise ships to countries. my 2c.

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

If implemented, can be added as a step in the steps folder, most likely called validate.js or something similar.

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

This may not work at all.

  • Deaths may be for individuals that were never tested, but were reported based on symptoms and/or post-mortem examination.
  • number of tests can be number of samples or number of individuals. Best practice is to run at least 2 samples per individual. Many depts. of health are not breaking the numbers out.

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

Thank you all for thinking about this. I have been wanting to see number of test administered, because you can only have as many positives as you have tested.

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

Are tests marked 0 if there are no records of tests for that location on that day?

@jzohrab
Copy link
Contributor Author

jzohrab commented Aug 9, 2020

(Transferred comment)

Test positive is one thing, but diagnostics criteria change, particularly when epidemic outpaces testing ability. At that point health authorities switch to other criteria such as any viral pneumonia, x-ray abnormalities etc. Italy and China, afaik, have made the switch. Marching orders is Italy are "test if it has clinical implications". Typically if you are not severe, well oxygenated, just fever, cough and discomfort, they send you home under quarantine with palliative care. I think cases should be the best estimate there is, positive tests is a lower bound. If we insist on test-confirmed only, events my make our numbers irrelevant.

@jzohrab jzohrab added the duplicate This issue or pull request already exists label Aug 9, 2020
@jzohrab jzohrab closed this as completed Aug 9, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
duplicate This issue or pull request already exists from-cds Transferred from https://github.com/covidatlas/coronadatascraper validation
Projects
None yet
Development

No branches or pull requests

1 participant