This dataset is designed for providing COVID-19 related data. Two types of data are contained: the epidemic data and events data.
This data is applied in AMiner COVID-19 Dashboard where several applications are integrated based on this dataset, including Epidemic Map, Trend Prediction, COVID-19 Risk Index, Events Relation Analysis, Events Influence Estimation and Entity Influence Estimation.
The epidemic data and events data are integrated from several data sources. A full list of data sources is listed below.
Download Links: Epimemic Data, Events Data. To view previously published data, refer to changelog for more data links.
For epidemic data, we provide historical data for over 1,000 regions. The first date can be traced back to January 22rd, 2020 when the COVID-19 pandemic is officially reported by the China government for the first time.
For events data, two major types of data are included. The Social News is automatically collected from several authoritative sources such as XINHUANET or WHO. The Academic Progress part can be further divided into three categories: Research Paper, Expert Opinion and Academic Activity. Some of these data are manually annotated while others are similarly automatically retrieved. To see detailed statistics of each data type, refer to changelog.
The epidemic data is organized into CSV data format and the events data is saved in JSON format since it has more complex structure. The epidemic data is updated daily and the events data is updated weekly.
WHO Covid situation reports ncov2019.live ncov.dxy.cn 163.com Bing
The Lancet Science Nature JAMA Network bioRxiv medRxiv ChemRxiv scienceDirect 中科院COVID-19科研动态监测 arXiv SCI666 科学网
WHO Rolling updates on coronavirus disease xinhuanet.com Worldometers 记疫-南都传媒
- The data provided in this repository, mostly comes from sources listed above, with some processing techniques applied to align names and detech abnormal changes. Some part of the data are collected and annotated by the AMiner team.
- This data can be only used for non-profit public health, educational, and academic research purposes. The commercial use is strictly prohibited.