Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ausprägung der Daten bei Korrektur der Altersgruppe, Datumsangaben, des Landkreises oder Geschlechts #11

Closed
horazont opened this issue Nov 22, 2021 · 3 comments

Comments

@horazont
Copy link

horazont commented Nov 22, 2021

Vorab: Dieses Repository ist eine sehr nützliche Ressource und die aufgeräumten CSV-Dateien sind viel angenehmer zu verarbeiten als der arcgis download. Insofern also schonmal großes Lob & Dank für die Bereitstellung.

Beim Verarbeiten der Daten, insbesondere dem Erstellen einer Spalte "Publikationsdatum" bin ich auf folgende Inkonsistenz gestoßen:

Im Datensatz von 2021-11-21 ist eine Fallgruppe enthalten, der -1 in NeuGenesen enthält, wozu es in 2021-11-20 keinen entsprechenden Eintrag gibt:

8115,A80+,W,2021-11-14,2021-11-14,0,0,-9,-1,1,0,-1

Es gibt in 2021-11-20 folgende Einträge, die auf 8115,A80+,W matchen:

grep -P '8115,A80\+,W,' 2021-11-20.csv
8115,A80+,W,2020-04-01,2020-03-06,1,0,0,-9,1,1,0
8115,A80+,W,2020-03-18,2020-03-13,1,0,-9,0,1,0,1
8115,A80+,W,2020-03-17,2020-03-14,1,0,-9,0,1,0,1
8115,A80+,W,2020-03-18,2020-03-17,1,0,-9,0,1,0,1
8115,A80+,W,2020-03-19,2020-03-18,1,0,0,-9,1,1,0
8115,A80+,W,2020-03-19,2020-03-19,0,0,-9,0,1,0,1
8115,A80+,W,2020-03-23,2020-03-20,1,0,-9,0,1,0,1
8115,A80+,W,2020-03-23,2020-03-23,0,0,-9,0,2,0,2
8115,A80+,W,2020-03-23,2020-03-23,1,0,0,-9,1,1,0
8115,A80+,W,2020-03-26,2020-03-26,0,0,-9,0,2,0,2
8115,A80+,W,2020-03-28,2020-03-26,1,0,-9,0,1,0,1
8115,A80+,W,2020-03-27,2020-03-27,0,0,-9,0,1,0,1
8115,A80+,W,2020-03-28,2020-03-28,0,0,-9,0,2,0,2
8115,A80+,W,2020-03-30,2020-03-29,1,0,0,-9,1,1,0
8115,A80+,W,2020-03-30,2020-03-30,0,0,-9,0,2,0,2
8115,A80+,W,2020-04-01,2020-03-31,1,0,0,-9,1,1,0
8115,A80+,W,2020-04-02,2020-03-31,1,0,0,-9,1,1,0
8115,A80+,W,2020-04-01,2020-04-01,0,0,-9,0,20,0,20
8115,A80+,W,2020-04-01,2020-04-01,0,0,0,-9,1,1,0
8115,A80+,W,2020-04-03,2020-04-01,1,0,0,-9,1,1,0
8115,A80+,W,2020-04-05,2020-04-01,1,0,0,-9,1,1,0
8115,A80+,W,2020-04-02,2020-04-02,0,0,-9,0,9,0,9
8115,A80+,W,2020-04-02,2020-04-02,1,0,0,-9,1,1,0
8115,A80+,W,2020-04-03,2020-04-03,0,0,-9,0,2,0,2
8115,A80+,W,2020-04-04,2020-04-04,0,0,-9,0,5,0,5
8115,A80+,W,2020-04-05,2020-04-05,0,0,-9,0,20,0,20
8115,A80+,W,2020-04-06,2020-04-06,0,0,-9,0,1,0,1
8115,A80+,W,2020-04-06,2020-04-06,0,0,0,-9,1,1,0
8115,A80+,W,2020-04-07,2020-04-07,0,0,-9,0,1,0,1
8115,A80+,W,2020-04-08,2020-04-08,0,0,-9,0,1,0,1
8115,A80+,W,2020-04-09,2020-04-09,0,0,-9,0,9,0,9
8115,A80+,W,2020-04-14,2020-04-09,1,0,-9,0,1,0,1
8115,A80+,W,2020-04-10,2020-04-10,0,0,-9,0,3,0,3
8115,A80+,W,2020-04-11,2020-04-11,0,0,-9,0,5,0,5
8115,A80+,W,2020-04-15,2020-04-15,0,0,-9,0,1,0,1
8115,A80+,W,2020-04-17,2020-04-17,0,0,-9,0,1,0,1
8115,A80+,W,2020-04-18,2020-04-18,0,0,-9,0,3,0,3
8115,A80+,W,2020-04-18,2020-04-18,0,0,0,-9,1,1,0
8115,A80+,W,2020-04-19,2020-04-19,0,0,-9,0,4,0,4
8115,A80+,W,2020-04-21,2020-04-21,0,0,-9,0,5,0,5
8115,A80+,W,2020-04-22,2020-04-22,0,0,-9,0,1,0,1
8115,A80+,W,2020-04-26,2020-04-26,0,0,-9,0,1,0,1
8115,A80+,W,2020-05-06,2020-05-06,0,0,-9,0,2,0,2
8115,A80+,W,2020-05-14,2020-05-14,0,0,-9,0,1,0,1
8115,A80+,W,2020-06-04,2020-06-04,0,0,-9,0,1,0,1
8115,A80+,W,2020-07-29,2020-07-29,0,0,-9,0,1,0,1
8115,A80+,W,2020-08-28,2020-08-28,0,0,-9,0,1,0,1
8115,A80+,W,2020-08-29,2020-08-29,0,0,-9,0,1,0,1
8115,A80+,W,2020-08-31,2020-08-31,0,0,-9,0,1,0,1
8115,A80+,W,2020-09-01,2020-09-01,0,0,-9,0,1,0,1
8115,A80+,W,2020-09-08,2020-09-08,0,0,0,-9,1,1,0
8115,A80+,W,2020-09-28,2020-09-28,0,0,-9,0,1,0,1
8115,A80+,W,2020-10-01,2020-09-29,1,0,0,-9,1,1,0
8115,A80+,W,2020-10-02,2020-10-02,0,0,-9,0,1,0,1
8115,A80+,W,2020-10-10,2020-10-10,0,0,-9,0,1,0,1
8115,A80+,W,2020-10-20,2020-10-20,0,0,-9,0,2,0,2
8115,A80+,W,2020-10-21,2020-10-21,0,0,-9,0,3,0,3
8115,A80+,W,2020-10-22,2020-10-22,0,0,-9,0,4,0,4
8115,A80+,W,2020-10-23,2020-10-23,0,0,-9,0,7,0,7
8115,A80+,W,2020-10-24,2020-10-24,0,0,-9,0,1,0,1
8115,A80+,W,2020-10-26,2020-10-26,0,0,-9,0,2,0,2
8115,A80+,W,2020-10-26,2020-10-26,0,0,0,-9,2,2,0
8115,A80+,W,2020-10-27,2020-10-27,0,0,-9,0,5,0,5
8115,A80+,W,2020-10-28,2020-10-27,1,0,0,-9,1,1,0
8115,A80+,W,2020-10-28,2020-10-28,0,0,-9,0,2,0,2
8115,A80+,W,2020-11-02,2020-10-28,1,0,0,-9,1,1,0
8115,A80+,W,2020-10-29,2020-10-29,0,0,-9,0,10,0,10
8115,A80+,W,2020-10-30,2020-10-30,0,0,-9,0,9,0,9
8115,A80+,W,2020-10-30,2020-10-30,0,0,0,-9,1,1,0
8115,A80+,W,2020-10-31,2020-10-31,0,0,-9,0,1,0,1
8115,A80+,W,2020-10-31,2020-10-31,0,0,0,-9,1,1,0
8115,A80+,W,2020-10-29,2020-10-31,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-01,2020-10-31,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-03,2020-11-01,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-04,2020-11-01,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-02,2020-11-02,0,0,-9,0,2,0,2
8115,A80+,W,2020-11-07,2020-11-02,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-03,2020-11-03,0,0,-9,0,8,0,8
8115,A80+,W,2020-11-05,2020-11-03,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-04,2020-11-04,0,0,-9,0,3,0,3
8115,A80+,W,2020-11-16,2020-11-04,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-05,2020-11-05,0,0,-9,0,22,0,22
8115,A80+,W,2020-11-06,2020-11-05,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-06,2020-11-06,0,0,-9,0,1,0,1
8115,A80+,W,2020-11-09,2020-11-06,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-12,2020-11-06,1,0,-9,0,1,0,1
8115,A80+,W,2020-10-28,2020-11-06,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-07,2020-11-07,0,0,-9,0,6,0,6
8115,A80+,W,2020-11-08,2020-11-08,0,0,-9,0,2,0,2
8115,A80+,W,2020-11-11,2020-11-09,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-11,2020-11-09,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-10,2020-11-10,0,0,-9,0,1,0,1
8115,A80+,W,2020-11-11,2020-11-11,0,0,-9,0,10,0,10
8115,A80+,W,2020-11-12,2020-11-12,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-13,2020-11-12,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-13,2020-11-13,0,0,-9,0,4,0,4
8115,A80+,W,2020-11-14,2020-11-14,0,0,-9,0,2,0,2
8115,A80+,W,2020-11-19,2020-11-14,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-15,2020-11-15,0,0,-9,0,1,0,1
8115,A80+,W,2020-11-10,2020-11-16,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-18,2020-11-16,1,0,-9,0,2,0,2
8115,A80+,W,2020-11-20,2020-11-16,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-24,2020-11-16,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-19,2020-11-16,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-17,2020-11-17,0,0,-9,0,5,0,5
8115,A80+,W,2020-11-18,2020-11-17,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-19,2020-11-17,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-18,2020-11-18,0,0,-9,0,5,0,5
8115,A80+,W,2020-11-18,2020-11-18,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-20,2020-11-18,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-19,2020-11-19,0,0,-9,0,6,0,6
8115,A80+,W,2020-11-20,2020-11-20,0,0,-9,0,1,0,1
8115,A80+,W,2020-11-20,2020-11-20,0,0,0,-9,1,1,0
8115,A80+,W,2020-11-26,2020-11-20,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-21,2020-11-21,0,0,-9,0,6,0,6
8115,A80+,W,2020-11-24,2020-11-21,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-25,2020-11-21,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-22,2020-11-22,0,0,-9,0,1,0,1
8115,A80+,W,2020-11-25,2020-11-22,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-26,2020-11-22,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-10,2020-11-22,1,0,-9,0,1,0,1
8115,A80+,W,2020-04-06,2020-11-22,1,0,0,-9,1,1,0
8115,A80+,W,2020-11-24,2020-11-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-26,2020-11-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-28,2020-11-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-25,2020-11-25,0,0,-9,0,1,0,1
8115,A80+,W,2020-11-25,2020-11-25,0,0,0,-9,1,1,0
8115,A80+,W,2020-11-23,2020-11-25,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-09,2020-11-25,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-26,2020-11-26,0,0,-9,0,4,0,4
8115,A80+,W,2020-11-26,2020-11-26,0,0,0,-9,1,1,0
8115,A80+,W,2020-11-28,2020-11-26,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-29,2020-11-26,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-01,2020-11-26,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-27,2020-11-27,0,0,-9,0,2,0,2
8115,A80+,W,2020-11-30,2020-11-27,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-28,2020-11-28,0,0,-9,0,2,0,2
8115,A80+,W,2020-11-10,2020-11-28,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-29,2020-11-29,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-04,2020-11-29,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-17,2020-11-29,1,0,-9,0,1,0,1
8115,A80+,W,2020-11-30,2020-11-30,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-05,2020-11-30,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-01,2020-12-01,0,0,-9,0,4,0,4
8115,A80+,W,2020-12-04,2020-12-01,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-02,2020-12-02,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-04,2020-12-02,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-03,2020-12-03,0,0,-9,0,3,0,3
8115,A80+,W,2020-12-03,2020-12-03,0,0,0,-9,1,1,0
8115,A80+,W,2020-12-10,2020-12-03,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-04,2020-12-04,0,0,-9,0,7,0,7
8115,A80+,W,2020-12-04,2020-12-04,0,0,0,-9,2,2,0
8115,A80+,W,2020-12-05,2020-12-05,0,0,-9,0,2,0,2
8115,A80+,W,2020-12-08,2020-12-05,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-07,2020-12-05,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-06,2020-12-06,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-09,2020-12-06,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-07,2020-12-07,0,0,0,-9,1,1,0
8115,A80+,W,2020-12-08,2020-12-07,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-08,2020-12-08,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-09,2020-12-08,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-11,2020-12-08,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-09,2020-12-09,0,0,-9,0,7,0,7
8115,A80+,W,2020-12-16,2020-12-09,1,0,-9,0,2,0,2
8115,A80+,W,2020-12-10,2020-12-10,0,0,-9,0,5,0,5
8115,A80+,W,2020-12-12,2020-12-10,1,0,-9,0,2,0,2
8115,A80+,W,2020-12-16,2020-12-10,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-11,2020-12-11,0,0,-9,0,3,0,3
8115,A80+,W,2020-12-16,2020-12-11,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-12,2020-12-12,0,0,-9,0,20,0,20
8115,A80+,W,2020-12-12,2020-12-12,0,0,0,-9,1,1,0
8115,A80+,W,2020-12-12,2020-12-12,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-14,2020-12-12,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-17,2020-12-12,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-13,2020-12-12,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-14,2020-12-14,0,0,-9,0,2,0,2
8115,A80+,W,2020-12-18,2020-12-14,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-04,2020-12-14,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-15,2020-12-15,0,0,-9,0,5,0,5
8115,A80+,W,2020-12-15,2020-12-15,0,0,0,-9,2,2,0
8115,A80+,W,2020-12-17,2020-12-15,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-18,2020-12-15,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-19,2020-12-15,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-22,2020-12-15,1,0,-9,0,2,0,2
8115,A80+,W,2020-12-16,2020-12-15,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-16,2020-12-16,0,0,-9,0,2,0,2
8115,A80+,W,2020-12-22,2020-12-16,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-17,2020-12-17,0,0,-9,0,12,0,12
8115,A80+,W,2020-12-22,2020-12-17,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-24,2020-12-17,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-18,2020-12-18,0,0,-9,0,9,0,9
8115,A80+,W,2020-12-17,2020-12-18,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-21,2020-12-18,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-19,2020-12-19,0,0,-9,0,3,0,3
8115,A80+,W,2020-12-21,2020-12-19,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-14,2020-12-19,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-20,2020-12-20,0,0,-9,0,2,0,2
8115,A80+,W,2020-12-21,2020-12-21,0,0,-9,0,3,0,3
8115,A80+,W,2020-12-22,2020-12-21,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-22,2020-12-22,0,0,-9,0,8,0,8
8115,A80+,W,2020-12-22,2020-12-22,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-23,2020-12-23,0,0,-9,0,4,0,4
8115,A80+,W,2020-12-12,2020-12-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-24,2020-12-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-28,2020-12-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-31,2020-12-23,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-24,2020-12-24,0,0,-9,0,2,0,2
8115,A80+,W,2020-12-25,2020-12-25,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-02,2020-12-25,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-26,2020-12-26,0,0,-9,0,3,0,3
8115,A80+,W,2020-12-30,2020-12-26,1,0,0,-9,1,1,0
8115,A80+,W,2021-01-02,2020-12-27,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-28,2020-12-28,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-30,2020-12-28,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-01,2020-12-28,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-08,2020-12-28,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-13,2020-12-28,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-07,2020-12-28,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-29,2020-12-29,0,0,-9,0,5,0,5
8115,A80+,W,2021-01-02,2020-12-29,1,0,0,-9,1,1,0
8115,A80+,W,2020-12-30,2020-12-30,0,0,-9,0,3,0,3
8115,A80+,W,2021-01-01,2020-12-30,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-02,2020-12-30,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-05,2020-12-30,1,0,-9,0,1,0,1
8115,A80+,W,2020-12-31,2020-12-31,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-01,2021-01-01,0,0,-9,0,4,0,4
8115,A80+,W,2021-01-14,2021-01-01,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-02,2021-01-02,0,0,-9,0,5,0,5
8115,A80+,W,2021-01-02,2021-01-02,0,0,0,-9,1,1,0
8115,A80+,W,2021-01-07,2021-01-02,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-02,2021-01-02,1,0,0,-9,1,1,0
8115,A80+,W,2021-01-03,2021-01-03,0,0,-9,0,3,0,3
8115,A80+,W,2021-01-04,2021-01-04,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-01,2021-01-04,1,0,0,-9,1,1,0
8115,A80+,W,2021-01-08,2021-01-04,1,0,0,-9,1,1,0
8115,A80+,W,2021-01-05,2021-01-05,0,0,-9,0,12,0,12
8115,A80+,W,2020-12-23,2021-01-05,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-08,2021-01-05,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-07,2021-01-07,0,0,-9,0,7,0,7
8115,A80+,W,2021-01-07,2021-01-07,0,0,0,-9,1,1,0
8115,A80+,W,2021-01-09,2021-01-07,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-08,2021-01-08,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-09,2021-01-09,0,0,-9,0,5,0,5
8115,A80+,W,2021-01-10,2021-01-10,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-12,2021-01-10,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-11,2021-01-11,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-12,2021-01-12,0,0,-9,0,3,0,3
8115,A80+,W,2021-01-12,2021-01-12,0,0,0,-9,1,1,0
8115,A80+,W,2021-01-13,2021-01-13,0,0,-9,0,2,0,2
8115,A80+,W,2021-01-14,2021-01-13,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-14,2021-01-14,0,0,-9,0,4,0,4
8115,A80+,W,2021-01-15,2021-01-15,0,0,-9,0,7,0,7
8115,A80+,W,2021-01-15,2021-01-15,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-16,2021-01-16,0,0,-9,0,3,0,3
8115,A80+,W,2021-01-18,2021-01-16,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-15,2021-01-16,1,0,0,-9,1,1,0
8115,A80+,W,2021-01-18,2021-01-18,0,0,-9,0,3,0,3
8115,A80+,W,2021-01-20,2021-01-20,0,0,-9,0,1,0,1
8115,A80+,W,2021-01-21,2021-01-20,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-22,2021-01-22,0,0,-9,0,2,0,2
8115,A80+,W,2021-01-25,2021-01-25,0,0,-9,0,1,0,1
8115,A80+,W,2020-12-29,2021-01-25,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-27,2021-01-25,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-28,2021-01-27,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-28,2021-01-28,1,0,-9,0,1,0,1
8115,A80+,W,2021-01-27,2021-02-01,1,0,-9,0,1,0,1
8115,A80+,W,2021-02-02,2021-02-02,0,0,-9,0,2,0,2
8115,A80+,W,2021-02-04,2021-02-02,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-06,2021-02-06,0,0,-9,0,7,0,7
8115,A80+,W,2021-02-06,2021-02-06,0,0,0,-9,2,2,0
8115,A80+,W,2021-02-09,2021-02-07,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-09,2021-02-09,0,0,-9,0,2,0,2
8115,A80+,W,2021-02-10,2021-02-09,1,0,-9,0,1,0,1
8115,A80+,W,2021-02-10,2021-02-10,0,0,-9,0,4,0,4
8115,A80+,W,2021-02-10,2021-02-10,0,0,0,-9,1,1,0
8115,A80+,W,2021-02-10,2021-02-10,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-11,2021-02-11,0,0,0,-9,1,1,0
8115,A80+,W,2021-02-12,2021-02-12,0,0,-9,0,9,0,9
8115,A80+,W,2021-02-12,2021-02-12,0,0,0,-9,1,1,0
8115,A80+,W,2021-02-12,2021-02-12,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-12,2021-02-13,1,0,-9,0,1,0,1
8115,A80+,W,2021-02-14,2021-02-14,0,0,-9,0,1,0,1
8115,A80+,W,2021-02-17,2021-02-14,1,0,-9,0,1,0,1
8115,A80+,W,2021-02-15,2021-02-15,0,0,-9,0,2,0,2
8115,A80+,W,2021-02-19,2021-02-15,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-16,2021-02-16,0,0,-9,0,4,0,4
8115,A80+,W,2021-02-16,2021-02-16,0,0,0,-9,1,1,0
8115,A80+,W,2021-02-17,2021-02-17,0,0,-9,0,9,0,9
8115,A80+,W,2021-02-17,2021-02-17,0,0,0,-9,5,5,0
8115,A80+,W,2021-02-17,2021-02-17,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-18,2021-02-18,0,0,-9,0,1,0,1
8115,A80+,W,2021-02-12,2021-02-18,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-19,2021-02-19,0,0,-9,0,4,0,4
8115,A80+,W,2021-02-10,2021-02-19,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-12,2021-02-19,1,0,0,-9,1,1,0
8115,A80+,W,2021-02-24,2021-02-24,0,0,-9,0,1,0,1
8115,A80+,W,2021-02-17,2021-02-24,1,0,0,-9,1,1,0
8115,A80+,W,2021-03-11,2021-03-01,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-06,2021-03-06,0,0,-9,0,1,0,1
8115,A80+,W,2021-03-24,2021-03-13,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-31,2021-03-17,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-23,2021-03-17,1,0,0,-9,1,1,0
8115,A80+,W,2021-03-19,2021-03-18,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-19,2021-03-19,0,0,-9,0,1,0,1
8115,A80+,W,2021-03-20,2021-03-20,0,0,-9,0,3,0,3
8115,A80+,W,2021-03-26,2021-03-20,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-22,2021-03-22,0,0,-9,0,3,0,3
8115,A80+,W,2021-03-23,2021-03-23,0,0,-9,0,2,0,2
8115,A80+,W,2021-03-24,2021-03-24,0,0,-9,0,2,0,2
8115,A80+,W,2021-03-24,2021-03-24,0,0,0,-9,1,1,0
8115,A80+,W,2021-03-26,2021-03-26,0,0,-9,0,1,0,1
8115,A80+,W,2021-03-31,2021-03-26,1,0,0,-9,1,1,0
8115,A80+,W,2021-04-01,2021-03-29,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-30,2021-03-29,1,0,0,-9,1,1,0
8115,A80+,W,2021-04-01,2021-04-01,0,0,0,-9,1,1,0
8115,A80+,W,2021-04-06,2021-04-02,1,0,0,-9,1,1,0
8115,A80+,W,2021-04-28,2021-04-05,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-06,2021-04-06,0,0,-9,0,1,0,1
8115,A80+,W,2021-04-07,2021-04-06,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-09,2021-04-06,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-07,2021-04-07,0,0,-9,0,1,0,1
8115,A80+,W,2021-04-13,2021-04-09,1,0,-9,0,1,0,1
8115,A80+,W,2021-03-29,2021-04-09,1,0,0,-9,1,1,0
8115,A80+,W,2021-04-10,2021-04-10,0,0,-9,0,1,0,1
8115,A80+,W,2021-04-23,2021-04-11,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-14,2021-04-14,0,0,-9,0,2,0,2
8115,A80+,W,2021-04-16,2021-04-16,0,0,-9,0,1,0,1
8115,A80+,W,2021-04-16,2021-04-16,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-21,2021-04-16,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-30,2021-04-17,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-23,2021-04-23,0,0,-9,0,1,0,1
8115,A80+,W,2021-05-06,2021-04-26,1,0,-9,0,1,0,1
8115,A80+,W,2021-04-27,2021-04-27,0,0,-9,0,1,0,1
8115,A80+,W,2021-04-27,2021-04-27,0,0,0,-9,1,1,0
8115,A80+,W,2021-05-06,2021-05-01,1,0,-9,0,1,0,1
8115,A80+,W,2021-05-04,2021-05-04,0,0,-9,0,1,0,1
8115,A80+,W,2021-05-06,2021-05-06,0,0,-9,0,3,0,3
8115,A80+,W,2021-05-08,2021-05-06,1,0,-9,0,1,0,1
8115,A80+,W,2021-05-10,2021-05-07,1,0,-9,0,1,0,1
8115,A80+,W,2021-05-08,2021-05-08,0,0,-9,0,1,0,1
8115,A80+,W,2021-05-10,2021-05-10,0,0,-9,0,1,0,1
8115,A80+,W,2021-05-11,2021-05-11,0,0,-9,0,1,0,1
8115,A80+,W,2021-05-17,2021-05-17,0,0,-9,0,1,0,1
8115,A80+,W,2021-05-27,2021-05-27,0,0,-9,0,1,0,1
8115,A80+,W,2021-06-03,2021-06-03,0,0,-9,0,1,0,1
8115,A80+,W,2021-07-21,2021-07-19,1,0,-9,0,1,0,1
8115,A80+,W,2021-07-29,2021-07-29,0,0,-9,0,1,0,1
8115,A80+,W,2021-08-02,2021-08-02,0,0,-9,0,1,0,1
8115,A80+,W,2021-08-09,2021-08-09,0,0,-9,0,1,0,1
8115,A80+,W,2021-08-17,2021-08-13,1,0,-9,0,1,0,1
8115,A80+,W,2021-08-23,2021-08-15,1,0,-9,0,1,0,1
8115,A80+,W,2021-08-25,2021-08-21,1,0,0,-9,1,1,0
8115,A80+,W,2021-08-31,2021-08-29,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-03,2021-08-30,1,0,-9,0,1,0,1
8115,A80+,W,2021-08-31,2021-08-31,0,0,-9,0,1,0,1
8115,A80+,W,2021-08-31,2021-08-31,0,0,0,-9,1,1,0
8115,A80+,W,2021-09-09,2021-09-04,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-09,2021-09-07,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-06,2021-09-08,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-13,2021-09-08,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-14,2021-09-14,0,0,0,-9,1,1,0
8115,A80+,W,2021-09-21,2021-09-14,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-15,2021-09-15,0,0,-9,0,1,0,1
8115,A80+,W,2021-09-20,2021-09-17,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-24,2021-09-21,1,0,-9,0,1,0,1
8115,A80+,W,2021-09-27,2021-09-25,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-02,2021-10-02,0,0,-9,0,1,0,1
8115,A80+,W,2021-10-02,2021-10-02,0,0,0,-9,1,1,0
8115,A80+,W,2021-10-13,2021-10-10,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-30,2021-10-10,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-22,2021-10-12,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-16,2021-10-13,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-17,2021-10-15,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-16,2021-10-15,1,0,0,-9,1,1,0
8115,A80+,W,2021-10-17,2021-10-15,1,0,0,-9,1,1,0
8115,A80+,W,2021-10-21,2021-10-17,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-22,2021-10-17,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-29,2021-10-17,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-20,2021-10-19,1,0,-9,0,2,0,2
8115,A80+,W,2021-10-20,2021-10-20,0,0,0,-9,1,1,0
8115,A80+,W,2021-10-21,2021-10-21,0,0,-9,0,2,0,2
8115,A80+,W,2021-10-23,2021-10-23,0,0,-9,0,1,0,1
8115,A80+,W,2021-10-27,2021-10-23,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-24,2021-10-24,0,0,-9,0,1,0,1
8115,A80+,W,2021-10-29,2021-10-24,1,0,-9,0,1,0,1
8115,A80+,W,2021-10-25,2021-10-25,0,0,-9,0,1,0,1
8115,A80+,W,2021-10-26,2021-10-26,0,0,-9,-9,1,0,0
8115,A80+,W,2021-10-26,2021-10-26,0,0,-9,0,3,0,3
8115,A80+,W,2021-10-26,2021-10-26,0,0,0,-9,1,1,0
8115,A80+,W,2021-10-27,2021-10-27,0,0,-9,0,1,0,1
8115,A80+,W,2021-11-02,2021-10-29,1,0,-9,0,1,0,1
8115,A80+,W,2021-11-02,2021-10-31,1,0,-9,0,1,0,1
8115,A80+,W,2021-11-03,2021-11-03,0,0,-9,-9,2,0,0
8115,A80+,W,2021-11-03,2021-11-03,0,0,-9,0,1,0,1
8115,A80+,W,2021-11-04,2021-11-04,0,0,-9,-9,2,0,0
8115,A80+,W,2021-11-05,2021-11-04,1,0,-9,0,1,0,1
8115,A80+,W,2021-11-07,2021-11-07,0,0,-9,-9,1,0,0
8115,A80+,W,2021-11-09,2021-11-09,0,0,-9,-9,4,0,0
8115,A80+,W,2021-11-10,2021-11-10,0,0,-9,-9,5,0,0
8115,A80+,W,2021-11-11,2021-11-11,0,0,-9,-9,7,0,0
8115,A80+,W,2021-11-12,2021-11-12,0,0,-9,-9,2,0,0
8115,A80+,W,2021-11-12,2021-11-12,0,1,-9,-9,1,0,0
8115,A80+,W,2021-11-13,2021-11-13,0,0,-9,-9,3,0,0
8115,A80+,W,2021-11-14,2021-11-14,0,0,-9,-9,1,0,0
8115,A80+,W,2021-11-15,2021-11-15,0,0,-9,-9,8,0,0
8115,A80+,W,2021-11-16,2021-11-16,0,0,-9,-9,1,0,0
8115,A80+,W,2021-11-17,2021-11-17,0,0,-9,-9,2,0,0
8115,A80+,W,2021-11-17,2021-11-17,0,1,-9,-9,1,0,0
8115,A80+,W,2021-11-18,2021-11-18,0,1,-9,-9,5,0,0
8115,A80+,W,2021-11-19,2021-11-19,0,1,-9,-9,2,0,0

(Die CSV-Dateien sind direkt aus dem Repository geladen.)

Dort gibt es zwar eine Fallgruppe, die ein Meldedatum von 2021-11-14 aufweist ist, aber dort ist keine Genesung gemeldet worden (-9). Meinem Verständnis nach sollte es für jede Fallgruppe, die -1 in (NeuerFall|NeuerTodesfall|NeuGenesen) hat, im Datensatz des Vortages eine Fallgruppe geben, die den gleichen Schlüssel (Landkreis+Altersgruppe+Geschlecht+Datumsangaben) hat, aber eine 0 oder eine 1 in der entsprechenden Spalte vorweist (und mindestens so viele Fälle wie die Fallgruppe mit -1 zurückzieht).

Übersehe ich etwas oder ist da ein Fehler in den Daten?

@HannesWuensche
Copy link
Contributor

Hallo @horazont,

vielen Dank für den Hinweis!
Die Erläuterung scheint mir Korrekt. Ich gebe den Fall an meine Kolleg:innen von der Fach-IT weiter.

Mit besten Grüßen
@HannesWuensche
für das Team RKI | Open Data

horazont added a commit to horazont/covid that referenced this issue Nov 22, 2021
Retractions are tricky. The previous approach did not consider that they
may easily come from the distant past. And we don't know exactly from
when:

1. Either a case has been introduced before the "historic data
   available" cutoff (i.e. from before when we have daily full case
   files): Then it was recorded at the Meldedatum in the record.

2. Or a case has been introdued after the "historic data available"
   cutoff. In that case, it has been recorded in the dataset at the
   exact date at which the dataset was published.

Unfortunately, to resolve the second case, we lack sufficient data: We
do not know the publication date of any recorded record. We have to
guess and start working our way forward starting from the reported date
until we find a timeslot where at least as many cases have been added as
are being retracted.

This is obviously not without potential flaws. For instance, if a case
group is reported with 4 new cases on day X and 3 cases on day X+1 and
later on, a retraction aimed at the case group on day X+1 comes in and
retracts all three cases. Then we'll remove the cases on day X, because
it is the first bin with enough matching cases available. If another
retraction comes in and attempts to remove the case group from day X, it
will not find a matching bin: the one at day X only has 1 case left, and
the one on day X+1 only had 3 cases to begin with.

In such cases, we'll now log a warning; originally, I wanted to make
this panicking, but it appears that at least one dataset has the issue of
retracting a case *which had never been reported* [1]! Hence, we cannot
be strict about this and need to hope that we'll not run into such a
situation too often.

(We can still detect it at a later point, because we'll see too many
cases in {cases,deaths,recovered}_pub_cum compared to the respective ref
series.)

   [1]: robert-koch-institut/SARS-CoV-2-Infektionen_in_Deutschland_Archiv#11
@HannesWuensche HannesWuensche changed the title Inkonsistenz zwischen 2021-11-20 und 2021-11-21 Ausprägung der Daten bei Korrektur der Altersgruppe, Datumsangaben, des Landkreises oder Geschlechts Nov 24, 2021
@HannesWuensche
Copy link
Contributor

Hallo @horazont,

die Kolleg:innen haben eine Blick in die Daten geworfen und folgende Erklärung gefunden:

Die beschriebe Ausprägung der Daten kann bei Korrektur der Altersgruppe, Datumsangaben, des Landkreises oder Geschlechts zustande kommen.
In solch einem Fall ist eine Zuordnung der zur Vorherigen Fallgruppe schwierig, da sich die Fallgruppe ändert und nicht mit dem neuen Fallgruppen-Schema (z.B. grep -P '8115,A80\+,W,' 2021-11-20.csv) gefunden werden kann.

In der Fallgruppe 8115,A80+,W,2021-11-14,2021-11-14,0,0,-9,-1,1,0,-1 ist es dabei sowohl zu einer Korrektur der Genesen-Status als auch der Altersangabe und des Erkrankungsdatums gekommen.

Allgemein gilt, wenn wir Probleme in den Datensätzen erkennen, bitten wir die Gesundheitsämter um Korrektur. In der derzeitigen Situation der völligen Überlastung der Gesundheitsämter können diese Korrekturen aber einige Zeit in Anspruch nehmen.

Mit besten Grüßen
@HannesWuensche
für das Team RKI | Open Data

@horazont
Copy link
Author

Hallo @HannesWuensche,

Vielen Dank für die Rückmeldung. Im Klartext heißt das, dass es keine Möglichkeit gibt, Korrekturen dem vorherigen Datensatz zuzuordnen, da die Korrektur nicht auf dem exakten Vortageszustand basiert?

Unglücklich, aber wohl nicht zu ändern.

Vielen Dank!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants