Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hospitalization data from South Korea #2225

Closed
WWolf opened this issue Jan 5, 2022 · 12 comments
Closed

Hospitalization data from South Korea #2225

WWolf opened this issue Jan 5, 2022 · 12 comments
Assignees
Labels
dom:hospital-icu Related to COVID-19 hospitalizations

Comments

@WWolf
Copy link
Contributor

WWolf commented Jan 5, 2022

This is related to the issue: #2219 (comment)

(a) The official English page contains the new hospital admissions stats (for past 7 days) under "Current status of new hospitalizations": http://ncov.mohw.go.kr/en/bdBoardList.do?brdId=16&brdGubun=161&dataGubun=&ncvContSeq=&contSeq=&board_id=

(b) It also contains "Current status of hospitalizations with moderate to severe symptoms" which is effectively the number of patients under ventilation+ (this is not daily admissions but the current number of patients): http://ncov.mohw.go.kr/en/bdBoardList.do?brdId=16&brdGubun=161&dataGubun=&ncvContSeq=&contSeq=&board_id=

I think (a) could be integrated to the automatic crawling system, and (b) might be relevant to the ICU patients stats.

Thank you!

@edomt
Copy link
Collaborator

edomt commented Jan 5, 2022

Hi @WWolf

Are full time series made available anywhere? It seems like these pages only include a snapshot of the latest figures, rather than a full historical time series.

@WWolf
Copy link
Contributor Author

WWolf commented Jan 6, 2022

Hi @edomt

The seriously ill and hospitalized (I think like to ICU patient stats, not admissions) are maintained daily here (from 2020-03-28 onward): https://docs.google.com/spreadsheets/d/10c9jNi8VnV0YYCfV_7AZrzBY5l18dOFHEJMIJsP4THI/edit#gid=334130338 (The same Seoul National University group that parses the official government records)

As for the daily hospital admissions, the page update came out after November, so no stats before that. If that is OK, I could backfill the stats from the individual governmental sources (they have usually much more detailed reports in Korean, see here for example: http://ncov.mohw.go.kr/tcmBoardView.do?brdId=3&brdGubun=31&dataGubun=&ncvContSeq=6243&board_id=312&contSeq=6243 , unfortunately nowhere near API-wise).

Thank you!

@edomt
Copy link
Collaborator

edomt commented Jan 7, 2022

Thanks! Do you know people from this group at Seoul National University? In order to use the data, we'd need to be able to access it as a CSV file.

(They can make the spreadsheet available by clicking on File > Share > Publish to the web.
Then select "Link", "Cases in Korea_Original", "Comma-separated values", click "Publish", and copy the link)

@edomt edomt added the dom:hospital-icu Related to COVID-19 hospitalizations label Jan 7, 2022
@WWolf
Copy link
Contributor Author

WWolf commented Jan 10, 2022

Hi @edomt,

I think one could just do one backfill, and then crawl all the relevant stats like #2217 as that has all the incremental values available. Would that work? Otherwise, I can contact them for the publication.

@WWolf
Copy link
Contributor Author

WWolf commented Jan 11, 2022

Like the pull request related to #2219 (comment),

I have scraped "new admission" and patients from moderate to severe conditions (related to ICU patients?) tallies starting from 2021-11-01 to present from official KDCA (Korean) documentations.

P.S. Exact definition of Hospitalizations with moderate to severe symptoms: patients receiving isolated treatment through high flow therapy, respirator, ECMO (extracorporeal membrane oxygenation), and CRRT (continuous renal replacement therapy)

I also checked that the ICU patient stat further goes back to 2020-03-28, and SNU ARIC dataset is faithfully represents the official stat. So I am wondering whether additional scrape code like #2217 could be done with backfills. If you could point to the file, I will make a pull request!

@edomt edomt self-assigned this Jan 11, 2022
@edomt
Copy link
Collaborator

edomt commented Jan 11, 2022

Thanks! The main issue for now is that we don't have the structure in place to accept incremental data collection for hospital & ICU data. The scripts are located in scripts/src/cowidev/hosp/sources but they are all batch scripts, i.e. they download or scrape an entire time series and return it to generate the dataset.

Potentially, if could you host the whole time series as a CSV somewhere (e.g. your own GitHub repo) and update it somewhat regularly, we could pull it directly from there.

@WWolf
Copy link
Contributor Author

WWolf commented Jan 11, 2022

Hi @edomt,

I have generated a simple repository: https://github.com/WWolf/korea-covid19-hosp-data

Hope this is OK. I will try to update it regularly (hope to do daily, but at least weekly).

@edomt
Copy link
Collaborator

edomt commented Jan 12, 2022

Thank you very much! The data is now live:

@edomt edomt closed this as completed Jan 12, 2022
@WWolf
Copy link
Contributor Author

WWolf commented Feb 28, 2022

Hi, this is a slight update:

I found that KDCA also has Monday weekly reports of weekly hospital admissions, and ICU new admissions (see screenshot and link ).

image

The weekly hospital admissions can be inferred from the daily admissions that I am keeping track on github, but ICU new admissions might be something that could be added as well. So I have added a CSV github table and have updated this retrospectively up to last October.

https://github.com/WWolf/korea-covid19-hosp-data/blob/main/weekly_icu.csv

Hope this could be potentially put into ICU admissions in OWID.

@edomt edomt reopened this Mar 7, 2022
@edomt
Copy link
Collaborator

edomt commented Mar 7, 2022

Thank you @WWolf !

@edomt edomt closed this as completed in a2a9a35 Mar 7, 2022
@WWolf
Copy link
Contributor Author

WWolf commented Sep 17, 2022

Hi @edomt, if this is of any use, I have added a daily tracker for the total beds that are utilized in South Korea (the columns indicate the severity). So in essence, the sum of all columns for a given date would be comparable to the Hospital patients metric in OWID. If you intend to incorporate this, I can scrape also older data before 2022-07-25 (by scraping).

https://github.com/WWolf/korea-covid19-hosp-data/blob/main/beds.csv

@edomt
Copy link
Collaborator

edomt commented Sep 19, 2022

Thank you @WWolf! We've just updated our script to include this data, it should go live over the next 24 hours.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dom:hospital-icu Related to COVID-19 hospitalizations
Projects
None yet
Development

No branches or pull requests

2 participants