Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any explanation for negative counts? #109

Closed
cpsievert opened this issue Aug 19, 2016 · 3 comments
Closed

Any explanation for negative counts? #109

cpsievert opened this issue Aug 19, 2016 · 3 comments

Comments

@cpsievert
Copy link

cpsievert commented Aug 19, 2016

Great job pulling this data together, and many thanks for making it public!

From what I gather, most of these counts are cumulative, and if you difference them, you get a surprising number of negative values. Anyone know why? It looks like someone is aware of this issue, and censoring them to 0 --

if (any(confirmed_codes != "")){
confirmed[2:length(confirmed)][confirmed[2:length(confirmed)] < 0] <- 0 }
if (any(suspected_codes != "")){
suspected[2:length(suspected)][suspected[2:length(suspected)] < 0] <- 0 }

BTW, I'm developing an R package that provides some more tools for exploring this data -- https://github.com/cpsievert/zikar

@dmrodz
Copy link
Collaborator

dmrodz commented Aug 19, 2016

These counts are reported as they are from each country and they change
quite a lot over time as cases get confirmed according to data field. So
we'll always see these events.

On Aug 19, 2016 11:20 AM, "Carson" notifications@github.com wrote:

From what I gather, most of these counts are cumulative, and if you
difference them, you get a surprising number of negative values. Anyone
know why? It looks like someone is aware of this issue, and censoring them
to 0 -- https://github.com/cdcepi/zika/blob/03365a5f6649bc115a62b04c2569b9
bdd948123f/code/Plot_cases_by_country/Plot_all.R#L83-L87

BTW, I'm developing an R package that provides some more tools for
exploring this data (give it a few days before trying the explore()
function).


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#109, or mute the thread
https://github.com/notifications/unsubscribe-auth/AHWUBKz8vddsUXcMhfiZwkqN2LQKxPojks5qhcmtgaJpZM4Jok88
.

@cpsievert
Copy link
Author

Right, but I don't understand how a cumulative number of confirmed infections could be 100 one week, then 80 the next week (I'm seeing this happen in multiple countries). Could most, if not all, of these instances be explained by mis-diagnosis?

@dmrodz
Copy link
Collaborator

dmrodz commented Aug 19, 2016

I see what you mean. Unfortunately, we can't answer that. The goal here is to accumulate the data as reported by each country.

@dmrodz dmrodz closed this as completed Sep 7, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants