Code Monkey home page Code Monkey logo

ebola's Introduction

Data for the 2014 Global Ebola outbeak

Announcements

Datamarket has made these data available through their API here. Their website also has interactive visualizations, and allows export to other file formats. As you browse the data on their site, please take note of possible errors, for example when cumulative counts temporarily drop. Please be aggressive about identifying and correcting these errors through pull requests, so we can improve our data quality.

Contents

  • country_timeseries.csv contains a time series of case counts and deaths is from the World Health Organization and WHO situation reports.

  • liberia_data/ contains .csv files of data provided by the Liberia Ministry of Health. I have noticed the data are somewhat inconsistent. Cross-check the data when analyzing.

  • sl_data/ contains .csv files of data provided by the Sierra Leone Ministry of Health

  • guinea_data/ contains a mix of .csv and PDF files from the Guinea Ministry of Health. These data are not consistently available online, so I will keep the PDFs in the repo for reference.

  • mali_data/ contains a mix of .csv and PDF files from the Mali Ministry of Health.

  • who_data/ contains data from the WHO that compare sitrep case counts with patient database counts for select cities and countries.

  • data_products/ contains analyses, processing scripts, etc. Highlights include:

    • liberia_data.py converts the liberia_data csv files into a multidimensional pandas dataframe. Pandas is a requirement for this script. Optional argument allows output to .csv. You can run this script with ./liberia_data.py --help to learn more.
  • line_list.csv is a line listing I manually compiled from media reports and published case series of case clusters. It is unverified and almost certainly contains errors. Use with extreme caution. The legrand compartment specifies with infectious compartment each case would originate from in the Legrand et al model. The source_id column is the case_id of the node from whom the case was infected.

  • Sierraleone_country.csv and SierraLeone_town.csv is from the Sierra Leone Ministry of Health website. Data in SierraLeone_town.csv is cumlative confirmed cases - counts do not include suspected or probable cases. These spreadsheets will no longer be updated as of Sept 12 (newer data can be found in the sl_data/* files), but pull requests will be accepted.

How to use

If you are not familiar with Github, click the Download Zip button on the right, at the bottom of vertical menu.

Disclaimer

I cannot guarantee the accuracy of this data. These data are digitized by hand, so there may be data entry errors; there may also be changes and errors in the source data. I will provide updates when possible.

Pull requests welcome.

Contact

I am Caitlin Rivers, a grad student in computational epidemiology at the Network Dynamics and Simulation Science Laboratory at Virginia Tech. Also see the NDSSL website for additional Ebola data resources. You can reach me at:

Please note: I receive numerous requests every day for customized versions of these data. While I appreciate that these data are in demand and am glad they are useful to you, I simply do not have time to provide customized versions.

ebola's People

Contributors

pallih avatar chendaniely avatar samccone avatar jsoma avatar pierrepo avatar rikblok avatar chrisvoncsefalvay avatar donpdonp avatar carlosp420 avatar waleo avatar aflaxman avatar chenghlee avatar dkergl avatar elofgren avatar gfairchild avatar grlurton avatar kdodia avatar luiscape avatar reidpr avatar runarberg avatar sergestinckwich avatar shawnacscott avatar rcquan avatar seanbeatty avatar

Watchers

John Tigue avatar James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.