Skip to content

trianglegrrl/ebola

 
 

Repository files navigation

Data for the 2014 Global Ebola outbeak

Announcements

Datamarket has made these data available through their API here. Their website also has interactive visualizations, and allows export to other file formats. As you browse the data on their site, please take note of possible errors, for example when cumulative counts temporarily drop. Please be aggressive about identifying and correcting these errors through pull requests, so we can improve our data quality.

Contents

  • country_timeseries.csv contains a time series of case counts and deaths is from the World Health Organization and WHO situation reports.

  • liberia_data/ contains .csv files of data provided by the Liberia Ministry of Health. I have noticed the data are somewhat inconsistent. Cross-check the data when analyzing.

  • sl_data/ contains .csv files of data provided by the Sierra Leone Ministry of Health

  • guinea_data/ contains a mix of .csv and PDF files from the Guinea Ministry of Health. These data are not available online, so I will keep the PDFs in the repo for reference.

  • liberia_data.py converts the liberia_data csv files into a multidimensional pandas dataframe. Pandas is a requirement for this script. Optional argument allows output to .csv.

    • You can run this script with ./liberia_data.py --help to learn more.
  • line_list.csv is a line listing I manually compiled from media reports and published case series of case clusters. It is unverified and almost certainly contains errors. Use with extreme caution. The legrand compartment specifies with infectious compartment each case would originate from in the Legrand et al model. The source_id column is the case_id of the node from whom the case was infected.

  • Sierraleone_country.csv and SierraLeone_town.csv is from the Sierra Leone Ministry of Health website. Data in SierraLeone_town.csv is cumlative confirmed cases - counts do not include suspected or probable cases. These spreadsheets will no longer be updated as of Sept 12 (newer data can be found in the sl_data/* files), but pull requests will be accepted.

Epicurves

Contribute

The analysis folder and analyses/README.md file contains community-contributed analyses. If you have something to contribute, send a pull request or email.

How to use

If you are not familiar with Github, click the Download Zip button on the right, at the bottom of vertical menu.

Disclaimer

I cannot guarantee the accuracy of this data. These data are digitized by hand, so there may be data entry errors; there may also be changes and errors in the source data. I will provide updates when possible.

Pull requests welcome.

Contact

I am Caitlin Rivers, a grad student in computational epidemiology at the Network Dynamics and Simulation Science Laboratory at Virginia Tech. Also see the NDSSL website for additional Ebola data resources. You can reach me at:

Please note: I receive numerous requests every day for customized versions of these data. While I appreciate that these data are in demand and am glad they are useful to you, I simply do not have time to provide customized versions.

About

Data for the 2014 ebola outbeak in West Africa

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published