Skip to content
This repository has been archived by the owner on Mar 10, 2023. It is now read-only.

Texas cleans 2020 data *major impact on Texas county-level case data* #5213

Open
CSSEGISandData opened this issue Jan 15, 2022 · 0 comments
Open

Comments

@CSSEGISandData
Copy link
Owner

Today, the US state of Texas published a cleaned dataset aligning their published case data with their final COVID-19 data of 2020 that they provided to CDC. This link contains the department's press release.

The impact of their cleaning on our dataset is two-fold:

  1. There will be impacts in case totals at the state level. We cannot at this time derive the precise impact within our dataset as we obtain case counts for Texas counties from various sources (including the state and county-level health departments as listed in our README), however the daily increase for Texas for the 1-15-2021 data product will be inflated due to these older cases.

  2. Numerous counties within Texas (65+ counties) will have anomalous data posted today, either in the form of large increases in case totals (>10% of total cumulative cases) or decreases in total cases. Any modeling that leverages our county-level case data for Texas over the next few days will likely be highly unstable if these anomalous changes aren't taken into account.

We are trying to engage with the state health department to see if there are means through which we can correct our historical data. As it is a holiday weekend, this may be delayed. We thank you in advance for your patience and will update this issue accordingly.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant