Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding the proposed removal of recovered values #1282

Open
cipriancraciun opened this issue Mar 23, 2020 · 7 comments
Open

Regarding the proposed removal of recovered values #1282

cipriancraciun opened this issue Mar 23, 2020 · 7 comments

Comments

@cipriancraciun
Copy link

@cipriancraciun cipriancraciun commented Mar 23, 2020

In issue #1250 it was announced that the recovered numbers are going to be removed from future versions of the datasets due to inconsistencies in reporting or issues in collecting these values.

However I would suggest keeping these numbers at least for the countries that do report them and give a disclaimer that this numbers are inaccurate. (Perhaps move these files to a separate folder to make it even more clear.)

I think the recovered values are essential for computing the number of active cases, which helps visualize the overall progress of the pandemic.

@cipriancraciun
Copy link
Author

@cipriancraciun cipriancraciun commented Mar 23, 2020

I'll link to this issue other issues regarding the removal of recovered values, like:

@cipriancraciun
Copy link
Author

@cipriancraciun cipriancraciun commented Mar 24, 2020

I have updated my previous comment about other linked issues. It seems that more and more would pop-up as the various derived datasets are being updated.

@KeulRalf
Copy link

@KeulRalf KeulRalf commented Mar 25, 2020

I'd agree. I would love to get recovered numbers where we can get those even though these are not complete for all countries. Currently Active is the most important number we report.
If some don't deliver I would prefer to get them at least for those countries data is trustable.
Thank you for everyone's effort here.

@lkounadis
Copy link

@lkounadis lkounadis commented Mar 25, 2020

please add recovered. Its really important to know. It actually helps people feel "safer" if you get my drift

@karen2k
Copy link

@karen2k karen2k commented Mar 25, 2020

Assuming people heal completely in N days at max, the number of recovered cases is equal to the delta of the confirmed cases minus deaths minus confirmed cases N days ago. You can find visualization here and play with N=14 or N=21: https://covid-19.karenishe.com/

@Akuukis
Copy link

@Akuukis Akuukis commented Mar 26, 2020

Most of dashboards use "recovered" to calculate "active" cases. I agree with JHU that recovered is badly counted and thus better not used at all, and thus the "active" cases are not accurate too.

IMHO The best solution is to use model by Tomas Pueyo that estimates "actual" cases based on deaths not recovered - so I made a dashboard that does it: https://pandemic-estimator.net

I'm happy to chat more on this direction, ping me in a issue of your own dashboards or create one here

@cipriancraciun
Copy link
Author

@cipriancraciun cipriancraciun commented Mar 26, 2020

I think there are conflicting issues (due to conflicting reasons) thus a lot of "drama" on this subject. Let me explain:

  • some people try just to build nice graphical representations -- thus even an estimate would be OK; (however they aren't statistics / math experts, thus they just want a "number", "any number";)
  • some people try to build statistical models, and they use the recovered values as inputs or as validation; (thus they want "the" number;)

Now JHU, because they say recovery data collection is unreliable decided to just drop it from every country, which basically let both categories without any inputs.

@Akuukis can you provide us with a simple formula that one can apply in a "normal" programming language (i.e. Python, Ruby, Java, etc.), thus not "maths" programming languages (like Julia, R, etc.)

If I can implement it I'll augment it in my own dataset.


I've looked in the "methodology" section of the pointed calculator, and although the formulas seem "simple", they require 4 parameters which I assume are quite dependent on the country's context...

Thus I am skeptical that any "computed" recovery values are accurate enough to be useful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
5 participants