New York City Borough Level Data now live #3084
Open
Labels
Comments
|
There is a small discrepancy in the naming, if you guys have time to fix (created Issue #3088):
|
Lucas-Czarnecki
added a commit
to Lucas-Czarnecki/COVID-19-CLEANED-JHUCSSE
that referenced
this issue
Sep 2, 2020
Adjust data according to JHU's shift to reporting borough level data. See: CSSEGISandData/COVID-19#3084
|
Do you anticipate providing historical |
Lucas-Czarnecki
added a commit
to Lucas-Czarnecki/COVID-19-CLEANED-JHUCSSE
that referenced
this issue
Sep 3, 2020
This commit addresses past daily reports that are missing values such as FIPS codes and geographical coordinates due to outdated Admin2 names; including, JHU's new approach to reporting New York cases at the borough-level (see CSSEGISandData/COVID-19#3084) as well as counties such as Williamson, Tennessee.
This was referenced Sep 9, 2020
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment

Hello all. After receiving lots of feedback on this decision, we are moving to providing borough level data for New York City. This Git issue will explain our methods for filling in historical data in our time series.
Consistent with our approach to collecting cases, historical cases have been sourced directly from the state of New York website.
Accessing the historical deaths has been slightly trickier. Consistent with other locations and the recommendations of the CDC, our reported deaths are the sum of confirmed and probable deaths.
Confirmed deaths at the Borough level are available on the GitHub managed by the NYC Department of Health, specifically in the following file. Of note, this file appears to be often revised, so our data is current as of today (August 31) but these numbers are subject to change.
Probable deaths at the borough level are available through daily commits to this file. Within these commits, probable deaths in the “unknown” or “data pending” categories are being placed in the “unassigned, New York” entry. Unfortunately, the probable deaths at this granularity are only available beginning May 18th. To generate the probable deaths prior to this date, the time series of total deaths in all of New York City presented on their GitHub in this file was used. To calculate probable deaths prior to May 18th, the following equation was used:
Probable deaths = (Total deaths reported in New York City) - (confirmed deaths at the borough level via the NYC health Github)
These probable deaths have been placed into the “Unassigned, New York” category. Of note, on the day where the logic changes from depending on the two New York City time series files to one of the time series files and the daily commits, we perceive a rise of 462 deaths. We believe that this is due to historical revisions to the borough level time series that haven’t been made to the city total time series. The prior logic, by definition, results in total deaths being equivalent to those reported in the city time series for that date. The strongest evidence for this is that the sum of deaths reported on May 19 using the second approach is greater than the number of deaths reported in the time series file until May 26. That total deaths converge the closer one gets to the present strongly suggests that this is do to redistributing deaths back in time. As far as we can tell, this is the sole approach to obtain borough-level data for the entirety of the pandemic.
The text was updated successfully, but these errors were encountered: