Code Monkey home page Code Monkey logo

covid's Introduction

Covid-19 Data Resources

This repository aims to provide a backbone of high geographic resolution administrative data to support analysis of and the policy response to the COVID-19 pandemic in India.

The current version includes estimates of hospital and clinic doctor and bed capacity (district level, and soon subdistrict), CFR predictions based on variation in local population age distribution (subdistrict level), urbanization rates and population density (subdistrict level and lower), as well as deaths and infections at the highest resolution possible. Crucially, all of these are described with common location identifiers, making it easy to link them together and to external data sources. Data are disaggregated by urban/rural where possible.

We have phone surveys on economic conditions in the field, which we will include here as they arrive. We will also harmonize and include data from as many other teams' surveys as possible, given data availability.

We are updating and adding to this repo as quickly as possible. If you are part of a team working with policymakers or researchers on the COVID-19 response in India and need administrative data not in this repository, please contact us ([email protected]) and we will add it to our list if we can obtain it.

This is an effort by Development Data Lab, led by Professors Sam Asher (Johns Hopkins SAIS) and Paul Novosad (Dartmouth College). If you use these data, please reference the source. This helps us continue to provide and develop this service. If you are interested in funding more rapid development of this data platform, please contact us at [email protected].

Data Folder Structure and Metadata

Relative paths here refer to paths in the data folder. CSV files are in a csv/ subfolder in each path.

Folder Description Data Files Metadata
covid/ Number of confirmed cases and deaths by date, district covid/covid_cases_deaths_district Link
Case-level patient data with state/district ids covid/covid_deaths_recoveries
demography/ Age pyramid of every district and subdistrict demography/age_bins_(sub)district_t Link
(Sub)district level slum populations, pop density, urbanization rates (PC) demography/pc11_demographics_(sub)district Link
estimates/ Modeled district hospital/clinic bed and doctor counts (EC,PC,DLHS) estimates/hospitals_dist Link
Modeled age-structured based fatality rate predictions estimates/(sub)district_age_dist_cfr
hospitals/ District-level public hospital/clinic bed and doctor counts from DLHS hospitals/dlhs4_hospitals_dist Link
(Sub)district-level public hospital/clinic bed and doctor counts from Pop Census hospitals/pc_hospitals_(sub)dist Link
District- and town/village-level hospital public/private hospital employment (EC) hospitals/ec_hospitals_(dist,tv) Link
keys/ Correspondences to link different datasets

Data Identified for Inclusion

Data Description Geographic level Scope
Comorbidity rates Local mortality multipliers based on rates of common conditions known to correlate with Covid-19 morbidity, such as diabetes. Source: NSS District
Gender composition Sex ratios in five year age bins State/District/Subdistrict
Lockdown policies Government-imposed restrictions/social distancing with details and dates State/District
Slums Slum populations, areas, proportions State/District/Town
Poverty Small area estimation consumption per capita and poverty rate estimates based on the Socioeconomic and Caste Census State/District/Subdistrict
Health staff Number of doctors, nurses, employees of health centers, etc. State/District/Subdistrict Total/Urban/Rural
Sectoral composition Share of employment in important sectors of the economy State/District/Subdistrict
Hospital Bed Availability by District

Repo / Code Structure

Directory Explanation
a/ Analysis file with material used in the current version of the data build.
b/ Build folder.
e/ Explore folder.
assets/ Various web assets, such as images embedded in README.md.

The root path of the folder only has one code file:

  • make_covid.do, which runs the full build and the full analysis.

Code Globals

This repository's build refers to locations of code and data using Stata global variables. You will need to set the following globals to run the code:

Global Explanation
$tmp A temporary folder for intermediate data and outputs.
$ccode Root folder for this repository.
$covidpub Root data folder for this repo.

A global can be set in Stata with e.g. global tmp temporary/directory/location.

Build Diagram

The full build, including both code and data, is diagrammatically described here.

Downloading the Data

This repository is structured such that the first half runs on DDL servers to produce datasets that serve as inputs for the COVID-related analytics, like the EC microdata file, the DLHS district-level aggregates, and a shortened VD/TD/PCA. The second half then needs to run on those files to produce the final outputs, like the hospital bed estimates.

You can download the data here:

Download Data

Bugs / Requests

If you come across bugs in the data or have a specific data request that we are likely to have, you can email [email protected] or email Paul, Sam or Toby directly.

The Team

This repo is a collaborative effort led by the Development Data Lab, co-founded by Sam Asher, Toby Lunt, and Paul Novosad. Additional contributors: Aditi Bhowmick, Ali Campion, Radhika Jain, Sam Besse, Kritarth Jha.

covid's People

Contributors

aditib738 avatar alison-campion avatar kritarth5 avatar paulnov avatar seasher04 avatar tobylunt avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.