cssegisanddata / covid-19 Goto Github PK

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE

Home Page: https://systems.jhu.edu/research/public-health/ncov/

johns-hopkins-university systems-science engineering covid-19 2019-ncov coronavirus csse jhu

covid-19's Introduction

COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University

On March 10, 2023, the Johns Hopkins Coronavirus Resource Center ceased its collecting and reporting of global COVID-19 data. For updated cases, deaths, and vaccine data please visit the following sources:

Global: World Health Organization (WHO)
U.S.: U.S. Centers for Disease Control and Prevention (CDC)

For more information, visit the Johns Hopkins Coronavirus Resource Center.

This is the data repository for the 2019 Novel Coronavirus Visual Dashboard operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). Also, Supported by ESRI Living Atlas Team and the Johns Hopkins University Applied Physics Lab (JHU APL).

Visual Dashboard (desktop): https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6

Visual Dashboard (mobile): http://www.arcgis.com/apps/opsdashboard/index.html#/85320e2ea5424dfaaa75ae62e5c06e61

Please cite our Lancet Article for any use of this data in a publication: An interactive web-based dashboard to track COVID-19 in real time

The Johns Hopkins University Center for Systems Science and Engineering COVID-19 Dashboard: data collection process, challenges faced, and lessons learned

Provided by Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE): https://systems.jhu.edu/

DONATE to the CSSE dashboard team: https://engineering.jhu.edu/covid-19/support-the-csse-covid-19-dashboard-team/

DATA SOURCES: This list includes a complete list of all sources ever used in the data set, since January 21, 2020. Some sources listed here (e.g. ECDC, US CDC, BNO News) are not currently relied upon as a source of data.

Aggregated data sources:
- World Health Organization (WHO): https://www.who.int/
- European Centre for Disease Prevention and Control (ECDC): https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases
- DXY.cn. Pneumonia. 2020. https://ncov.dxy.cn/ncovh5/view/pneumonia?from=dxy&source=&link=&share=
- QQ News https://news.qq.com/zt2020/page/feiyan.htm#/
- US CDC: https://www.cdc.gov/coronavirus/2019-ncov/index.html
- BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/
- WorldoMeters: https://www.worldometers.info/coronavirus/
- 1Point3Arces: https://coronavirus.1point3acres.com/en
- COVID Tracking Project: https://covidtracking.com/data. (US Testing and Hospitalization Data. We use the maximum reported value from "Currently" and "Cumulative" Hospitalized for our hospitalization number reported for each state.)
- Los Angeles Times: https://www.latimes.com/projects/california-coronavirus-cases-tracking-outbreak/
- The Mercury News: https://www.mercurynews.com/tag/coronavirus/
US data sources at the state (Admin1) or county/city (Admin2) level:
- Alabama: Department of Public Health
- Alaska: Department of Health and Social Services
- Arizona: Department of Health Services
- Arkansas: Department of Health
- California: Department of Public Health
  - Mariposa County
  - Alameda County
  - Fresno County
  - Humboldt County
  - Imperial County
  - Los Angeles County
  - Madera County
  - Marin County
  - Mendocino County
  - Orange County
  - Placer County
  - Riverside County
  - Sacramento County
  - San Benito County
  - San Bernardino County
  - San Diego County
  - San Francisco
  - San Joaquin County
  - San Mateo County
  - Santa Clara County
  - Santa Cruz County
  - Shasta County
  - Solano County
  - Sonoma County
  - Stanislaus County
  - Ventura County
  - Yolo County
- Colorado: Department of Public Health and Environment
  - Colorado Department of Public Health and Environment Open Data Portal
- Connecticut: Department of Public Health
- Delaware: Emergency Management Agency
- District of Columbia: Government of The District of Columbia
- Florida: Department of Health & U.S. Department of Health & Human Services
- Georgia: Department of Public Health
- Guam: Department of Public Health and Social Services
- Hawaii: Department of Health
- Idaho: State Government
- Illinois: Department of Public Health
- Indiana: Department of Health
- Iowa: State Government
- Kansas: Department Of Health And Environment
  - Douglas County
  - Finney County
  - Riley County
  - ~~Reno County~~
- Kentucky: Department of Public Health
- Louisiana: Department of Health
- Maine: Department of Health and Human Services
- Maryland: Department of Health
  - State GitHub
- Massachusetts: Department of Public Health
- Michigan: Michigan.gov
- Minnesota: Department of Health
- Mississippi: Department of Health
- Missouri: Department of Health
  - Nodaway County
  - St. Louis City
  - St. Louis County
- Montana: Department of Public Health and Human Services
- Nebraska
  - Nebraska Department of Health and Human Services
  - Centers for Disease Control and Prevention
- Nevada: Department of Health and Human Services
  - Reno County Health Department
- New Hampshire: Department of Health and Human Services
  - Department of Health and Human Services Press Releases
- New Jersey: Department of Health
- New Mexico: Department of Health
- New York: State Department of Health
  - New York City Health Department
  - NYC Department of Health and Mental Hygiene & Github Repo
- North Carolina: Department of Health and Human
- North Dakota: Department of Health
- Northern Mariana Islands: Northern Mariana Islands Commonwealth Dept of Public Health
- Ohio: Department of Health
- Oklahoma: Department of Health
- Oregon: Health Authority
- Pennsylvania: Department of Health
  - Philadelphia
  - Lancaster County
  - Chester County
- Puerto Rico: Departamento de Salud
- Rhode Island: Department of Health
- South Carolina: Department of Health and Environmental Control
- South Dakota: Department of Health
- Tennessee: Department of Health
- Texas: Department of State Health Services
  - Amarillo County
  - Brazoria County
  - Brazos County
  - Cameron County
  - Collin County
  - Corpus Christi
  - Denton County
    - Note: The dashboard includes cases identified via at-home antigen tests in the case total. We exlcude these cases from our reported total.
  - Ector County
  - City of El Paso
  - Fayette County
  - Fort Bend County
  - Galveston County Health District
  - Harris County
  - Hays County
  - Hidalgo County
  - Laredo
  - Midland County
  - Mount Pleasant
  - Montgomery County
  - Potter County
    - We are aware that the totals reported on our dashboard do not match the headline numbers on the dashboard frontend. We can confirm that our numbers match the dashboard backend and the totals within the graphs on the site. We do not know why there is a discrepancy and have reached out to the county for further information.
  - San Angelo 1
  - San Angelo 2
  - San Antonio
  - Tarrant County
  - Travis County
  - Williamson County
- Utah: Department of Health
- Vermont: Department of Health
- Virgin Islands: Department of Health and COVID-19 Report
- Virginia: Department of Health
- Washington: Department of Health
- West Virginia: Department of Health & Human Resources
- Wisconsin: Department of Health Services (https://data.dhsgis.wi.gov/datasets/wi-dhs::covid-19-data-by-county-v2/about)
- Wyoming: Department of Health
Non-US data sources at the country/region (Admin0) or state/province (Admin1) level:
- Albania:
  - National Agency for Information Society: https://coronavirus.al/statistika/
- Argentina:
  - Ministry of Health: https://www.argentina.gob.ar/salud/coronavirus-COVID-19/sala-situacion
- Australia:
  - Government Department of Health: https://www.health.gov.au/news/coronavirus-update-at-a-glance
  - COVID Live: https://www.covidlive.com.au/
- Azerbaijan:
  - Azerbaijan Operational Headquarters under the Cabinet of Ministers: https://koronavirusinfo.az/az
- Belarus:
  - Ministry of Health: https://stopcovid.belta.by/
- Belgium:
  - Sciensano: https://datastudio.google.com/embed/reporting/c14a5cfc-cab7-4812-848c-0369173148ab/page/giyUB
- Brazil:
  - Ministry of Health: https://covid.saude.gov.br/
  - Federal University of Viçosa - Brazil: https://github.com/wcota/covid19br (Data described in DOI: 10.1590/SciELOPreprints.362)
- Burma (Myanmar):
  - Myanmar Ministry of Health and Sports: https://doph.maps.arcgis.com/apps/dashboards/f8fb4ccc3d2d42c7ab0590dbb3fc26b8
- Canada:
  - Government of Alberta: https://www.alberta.ca/covid-19-alberta-data.aspx
  - Government of British Columbia Centre for Disease Control: https://experience.arcgis.com/experience/a6f23959a8b14bfa989e3cda29297ded
  - Government of Canada: https://www.canada.ca/en/public-health/services/diseases/coronavirus.html
  - Government of Manitoba: https://www.gov.mb.ca/covid19/updates/cases.html
  - Government of New Brunswick: https://experience.arcgis.com/experience/8eeb9a2052d641c996dba5de8f25a8aa
  - Government of Northwest Territories: https://www.gov.nt.ca/covid-19/
  - Government of Ontario: https://covid-19.ontario.ca/data
    - Ottawa Public Health: https://www.ottawapublichealth.ca/en/reports-research-and-statistics/daily-covid19-dashboard.aspx
    - Toronto Public Health: https://www.toronto.ca/home/covid-19/covid-19-latest-city-of-toronto-news/covid-19-status-of-cases-in-toronto/
    - Region of Peel: https://peelregion.ca/coronavirus/case-status/
    - Region of Halton: https://www.halton.ca/For-Residents/Immunizations-Preventable-Disease/Diseases-Infections/New-Coronavirus
  - Government of Prince Edward Island: https://www.princeedwardisland.ca/en/information/health-and-wellness/pei-covid-19-case-data
  - Government of Quebec: https://www.quebec.ca/en/health/health-issues/a-z/2019-coronavirus/situation-coronavirus-in-quebec/
  - Nunavut Department of Health: https://www.gov.nu.ca/health/information/covid-19-novel-coronavirus
- Chile:
  - Ministry of Health: https://www.minsal.cl/nuevo-coronavirus-2019-ncov/casos-confirmados-en-chile-covid-19/
  - Ministry of Communications: https://www.gob.cl/coronavirus/cifrasoficiales/
- China:
  - qq.com: https://news.qq.com/zt2020/page/feiyan.htm#/
  - ~~National Health Commission of the People’s Republic of China (NHC): http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml~~
  - ~~China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm~~
- Colombia:
  - National Institute of Health: http://www.ins.gov.co/Noticias/Paginas/Coronavirus.aspx
- Czech Republic (Czechia):
  - National Health Information System, Regional Hygiene Stations, Ministry of Health of the Czech Republic: https://onemocneni-aktualne.mzcr.cz/covid-19
- Denmark:
  - Statens Serum Institute: https://experience.arcgis.com/experience/aa41b29149f24e20a4007a0c4e13db1d
- Ecuador:
  - Ministry of Public Health: https://www.salud.gob.ec/actualizacion-de-casos-de-coronavirus-en-ecuador/
- El Salvador:
  - Government of El Salvador: https://covid19.gob.sv/
- Finland:
  - THL/National Infectious Disease Register: https://experience.arcgis.com/experience/92e9bb33fac744c9a084381fc35aa3c7
- France:
  - French Ministry of Solidarity and Health and Public Health Dashboard: https://dashboard.covid19.data.gouv.fr/ (retired)
  - French Ministry of Solidarity and Health and Public Health Data: https://www.data.gouv.fr/en/datasets/donnees-relatives-a-lepidemie-de-covid-19-en-france-vue-densemble/ (stopped on May 17, 2022)
  - French Ministry of Solidarity and Health and Public Health Data: https://www.data.gouv.fr/fr/datasets/synthese-des-indicateurs-de-suivi-de-lepidemie-covid-19/
  - OpenCOVID19: https://github.com/opencovid19-fr (retired)
- Georgia:
  - Government of Georgia Ministry of Health: https://stopcov.ge/en
- Germany:
  - Berliner Morgenpost: https://interaktiv.morgenpost.de/corona-virus-karte-infektionen-deutschland-weltweit/ (retired March 31, 2022)
  - Robert Koch Institute: https://www.rki.de/EN/Content/infections/epidemiology/outbreaks/COVID-19/COVID19.html
- Greece:
  - National Public Health Organization: https://covid19.gov.gr/covid19-live-analytics
- Guatemala:
  - Minesterio de Salud Publica Y Asistencia Social: https://tablerocovid.mspas.gob.gt/
- Hong Kong SAR:
  - The Government of The Hong Kong Special Administrative Region Website: https://www.chp.gov.hk/en/features/102465.html
  - The Government of The Hong Kong Special Administrative Region Dashboard: https://chp-dashboard.geodata.gov.hk/covid-19/en.html
- Hungary:
  - ~~Government of Hungary: https://koronavirus.gov.hu/~~ Discontinued 01/03/2023
- Iceland:
  - Directorate of Health and Department of Civil Protection and Emergency Management: https://www.covid.is/data
- India:
  - Government of India: https://www.mygov.in/covid-19
- Indonesia:
  - National Board for Disaster Management: https://covid19.go.id/peta-sebaran
- Ireland:
  - Government of Ireland: https://covid19ireland-geohive.hub.arcgis.com/
- Israel:
  - Ministry of Health Website: https://govextra.gov.il/ministry-of-health/corona/corona-virus/
  - Ministry of Health Dashboard: https://datadashboard.health.gov.il/COVID-19/general
- Italy:
  - Civil Protection Department: https://github.com/pcm-dpc/COVID-19/tree/master/
  - Ministry of Health: http://www.salute.gov.it/nuovocoronavirus
- Japan:
  - MHLW: https://covid19.mhlw.go.jp/extensions/public/en/index.html
- Jordan:
  - Ministry of Health: https://corona.moh.gov.jo/en
- Kazakhstan:
  - Kazinform: https://www.coronavirus2020.kz/
- Kosovo:
  - National Institute of Health of Kosovo Dashboard: https://corona-ks.info/?lang=en
  - National Institute of Health of Kosovo JSON: https://raw.githubusercontent.com/bgeVam/Kosovo-Coronatracker-Data/master/data.json
  - National Institute of Health of Kosovo Data Studio Dashboard: https://datastudio.google.com/embed/reporting/2e546d77-8f7b-4c35-8502-38533aa0e9e8/page/MT0qB
- Lebanon
  - Lebanese Ministry Of Information: https://corona.ministryinfo.gov.lb/
- Lithuania:
  - Government of Lithuania: https://experience.arcgis.com/experience/cab84dcfe0464c2a8050a78f817924ca
- Luxembourg:
  - Government of Luxembourg: https://data.public.lu/fr/datasets/covid-19-rapports-journaliers/#_
- Macau SAR:
  - Health Services of the Government of the Macau Special Administrative Region: https://www.ssm.gov.mo/portal/
- Malaysia
  - Ministry of Health Website: https://covid-19.moh.gov.my/
  - Ministry of Health Dashboard: https://covidnow.moh.gov.my/bm/cases
  - Official data on the COVID-19 epidemic in Malaysia. Powered by CPRC, CPRC Hospital System, MKAK, and MySejahtera: https://github.com/MoH-Malaysia/covid19-public
- Mexico:
  - Government of Mexico: https://datos.covid-19.conacyt.mx/#DOView
- Monaco:
  - Gouvernement Princier Principaute de Monaco: https://www.gouv.mc/Action-Gouvernementale/Coronavirus-Covid-19/Actualites
- Netherlands:
  - National Institute for Health and Environment: https://experience.arcgis.com/experience/ea064047519040469acb8da05c0f100d
- New Zealand:
  - Ministry of Health: https://www.health.govt.nz/our-work/diseases-and-conditions/covid-19-novel-coronavirus/covid-19-data-and-statistics/covid-19-current-cases
  - Government of Cook Islands: https://covid19.gov.ck/
- Palau:
  - Ministry of Health & Human Services: http://www.palauhealth.org/2019nCoV_SitRep/MOH-COVID-19%20Situation%20Report.pdf
- Paraguay:
  - Ministerio de Salud Publica Y Bienestar Social: https://www.mspbs.gov.py/reporte-covid19.html
- Pakistan:
  - Government of Pakistan: http://covid.gov.pk/stats/pakistan
- Peru:
  - Ministry of Health Dashboard: https://covid19.minsa.gob.pe/sala_situacional.asp
  - Ministry of Health Press Releases: https://www.gob.pe/busquedas?categoria[]=6-salud&contenido[]=noticias&institucion[]=minsa&sheet=1&sort_by=recent&tipo_noticia[]=3-comunicado
- Philippines:
  - Republic of Philippines Department of Health: https://doh.gov.ph/covid19tracker
- Poland:
  - Service of the Republic of Poland: https://www.gov.pl/web/koronawirus/wykaz-zarazen-koronawirusem-sars-cov-2
- Portugal:
  - General Directorate of Health: https://esriportugal.maps.arcgis.com/apps/dashboards/acf023da9a0b4f9dbb2332c13f635829 (expired on March 13, 2022. Transferred to WHO.)
- Romania:
  - Government of Romania: https://datelazi.ro/
- Russia:
  - Government of The Russian Federation: https://xn--80aesfpebagmfblc0a.xn--p1ai/information/
- Saudi Arabia:
  - Saudi Arabia Ministry of Health: https://covid19.moh.gov.sa/
- Serbia:
  - Ministry of Health of the Republic of Serbia: https://covid19.rs/homepage-english/
- Singapore:
  - Singapore Ministry of Health: https://www.moh.gov.sg/covid-19
- Slovakia:
  - Ministry of Investment, Regional Development and Information: https://korona.gov.sk/koronavirus-na-slovensku-v-cislach/#covid-aut-nasledujuci-pondelok
- Slovenia:
  - Sledilnik: https://covid-19.sledilnik.org/en/stats
- South Africa:
  - South Africa Department of Health: https://sacoronavirus.co.za/
- South Korea:
  - Ministry of Health and Welfare: http://ncov.mohw.go.kr/
- Spain:
  - RTVE: https://www.rtve.es/noticias/20200514/mapa-del-coronavirus-espana/2004681.shtml
- Sweden:
  - The Swedish Public Health Agency: https://experience.arcgis.com/experience/09f821667ce64bf7be6f9f87457ed9aa
- Switzerland:
  - Federal Office Of Public Health: https://www.bag.admin.ch/bag/en/home/krankheiten/ausbrueche-epidemien-pandemien/aktuelle-ausbrueche-epidemien/novel-cov/situation-schweiz-und-international.html
  - Open Government Data Reported By The Swiss Cantons: https://github.com/openZH/covid_19
- Taiwan*:
  - CDC: https://sites.google.com/cdc.gov.tw/2019ncov/taiwan?authuser=0
- Thailand:
  - Ministry of Public Health, Department of Disease Control Dashboard: https://ddc.moph.go.th/viralpneumonia/eng/index.php
  - Ministry of Public Health, Department of Disease Control Situational Reports: https://covid19.ddc.moph.go.th/en
- Turkey:
  - ~~Republic of Turkey Ministry of Health: https://covid19.saglik.gov.tr/TR-66935/genel-koronavirus-tablosu.html~~
  - ~~Digital Transformation Office of The Presidency of The Republic of Turkey: https://corona.cbddo.gov.tr/Home/GetLastDayDifference~~
- Ukraine:
  - Office of the National Security and Defense Council of Ukraine: https://covid19.rnbo.gov.ua/
- United Arab Emirates:
  - The Supreme Council For National Security, National Emergency Crisis and Disasters Management Authority: https://covid19.ncema.gov.ae/en
- United Kingdom
  - Government of the United Kingdom: https://coronavirus.data.gov.uk/#category=nations&map=rate
  - Scottish Government: https://www.gov.scot/publications/coronavirus-covid-19-trends-in-daily-data/

Embed our dashboard into your webpage:

<style>.embed-container {position: relative; padding-bottom: 80%; height: 0; max-width: 100%;} .embed-container iframe, .embed-container object, .embed-container iframe{position: absolute; top: 0; left: 0; width: 100%; height: 100%;} small{position: absolute; z-index: 40; bottom: 0; margin-bottom: -15px;}</style><div class="embed-container"><iframe width="500" height="400" frameborder="0" scrolling="no" marginheight="0" marginwidth="0" title="COVID-19" src="https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6"></iframe></div>

Acknowledgements: We are grateful to the following organizations for supporting our Center’s COVID-19 mapping and modeling efforts: Financial Support: Johns Hopkins University, National Science Foundation (NSF), Bloomberg Philanthropies, Stavros Niarchos Foundation; Resource support: AWS, Slack, Github; Technical support: Johns Hopkins Applied Physics Lab (APL), Esri Living Atlas team

Additional Information about the Visual Dashboard: https://systems.jhu.edu/research/public-health/ncov/

Contact Us:

Email: [email protected]

Terms of Use:

This data set is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) by the Johns Hopkins University on behalf of its Center for Systems Science in Engineering. Copyright Johns Hopkins University 2020.
Attribute the data as the "COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University" or "JHU CSSE COVID-19 Data" for short, and the url: https://github.com/CSSEGISandData/COVID-19.
For publications that use the data, please cite the following publication: "Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Inf Dis. 20(5):533-534. doi: 10.1016/S1473-3099(20)30120-1"

covid-19's People

Contributors

Stargazers

Watchers

Forkers

enshengdong smilefounder haggaishachar aceddreamer royryando jpiter jtylergary chloe2 dawenx mojogene shenaldesilva haripriya37 stevehouf otsuka-yuji-meta harryboot yasarkocyigit ycho2020 emtechsg kaichih abdulgader delulytric jamesquintero j-p-zhang geor7 liam-cloud-hogan trying-hustle koa87 edwardmfho dark-angel2019 michaelbrightman mustafabhatkar taggarse-athanor ainsleyotten gui-andrade marsqqq kevinlanning kennedydev wesleywei525 y-chu jadeywang malte-lech eric-valente icatorze looking2thefuturejnk benny0924 nandaaulia mwahyur46 dedijunadi charchitdahal rock-vbcity mattocci27 hkdigitalanalytics weileelu roachmd catcatcatcat wmrasmussen zxcbv zstan-vader1 tongni1975 clcarboni marcolin26 nicolopb philliprobbers wngmw phurichai hyperbi tutuba7 tongli088 derfsky qmiwang takahashihidenori donkeyshot patricktt zlegge jtan143 samimesfin asdicelli switch-back johnwan12 wbgarner carlarcd dalaidsa sapient007 vitalkhali vardaasen cliu822 svl8epa4gh ahmedbenarab iuriandreazza jovictor64 koo10e jmorentz chahs33 araimono py-ooi rajpranz mozezdata22020 jubaer101 janakpatil jshaffer94247

covid-19's Issues

Data error in 02-13-2020_2115.csv

See below screenshot, for Hubei, the number for Deaths and Recovered are wrong.

See below for screenshot from qq.com at 6:51AM EDT 02-14-2020.

See below for screenshot from DXY at 6:56AM EDT 02-14-2020.

Date format issue

when pull to google sheet the date column display as 37276.4166666667 for the first entry of Feb2020?

Time series data is gone?

The below path is empty

https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_time_series

The below file is clearly broken with rows where province='Confirmed'

https://github.com/CSSEGISandData/COVID-19/blob/master/who_covid_19_situation_reports/who_covid_19_sit_rep_time_series/who_covid_19_sit_rep_time_series.csv

What's going on? What's supposed to be the definitive data source for this now?

Also it would be super nice if you folk didn't randomly make breaking changes with 0 warning. There is a lot of analysis and information tooling downstream of your dataset now. If you need assistance in data management/stewardship open an issue and I or any of the other dozens of SMEs active on this repo can assist with that.

Small problem: calculating death rates

It's tempting to try and calculate how deadly a virus is by just dividing the number of people who have died by the number of people who have died plus the number who have recovered, but there's a problem with the number you get: it's too high.

The calculation needs to compare people who were infected at the same time, because it takes longer to recover than it does to die.

Eventually the simple calculation and the complicated calculation will converge, but we're not there yet. :-/

Table Column has changed

By Today Feb-23 (and also yesterday) I realised that tables has change columns headers. I am not sure but I realised also that the class name for table header is not correspond with coulum values

Also I have an open repository here - https://github.com/igoralves1/scrap2019-ncov

I am using puppeteer for async scrapping. I just updated today with the latest changes.

add data dictionary to README.md

What does “confirmed” mean? Recently there were discussions on defining confirmed as “tested positive/exhibit symptoms” as opposed to “tested positive/asymptomatic”. Are we looking at daily or cumulative readings? The latter maybe obvious but it would be nice to add a full data description to the repo.

Which time series are correct now? Total figures from earlier days are different

Which daily dates are still correct at all?

Referred to "confirmed" in the "time series".

At Google Spreadsheet and the last days here at Github the data was usually entered twice a day. Mostly in the morning and late in the evening. To be seen in the first line with the date and the given time.

Since you've changed it now (1 time a sum for one day) here at GitHub; how can it be that the daily sum is less compared to the old data (Spreadsheet, GitHub) for the day instead of more?

For example, February 9th. I was comparing saved records. These include the last (February 11th) from Google Spreadsheet and also February 11th from GitHub, as well as February 12th GitHub, yesterday and today GitHub.

If I compare all of them, except for today, the result is the same total sum for the day. 40,536 confirmed. I took the last entry for this on February 9th at 23:20.

If I look at today's data that is available in GitHub and take February 9th, I get only 40,151 confirmed for the calculation.

As you could see, the other data sets had the time at 23.20 on the second measurement on February 9th. So 20 minutes before midnight.

How can it be that the new data sets, with the once existing daily statistics, are now negative? The 20 minute difference for a new day is not long, but the few confirmed at the end for February 9th 385 are more than before is quite strange.

Differences (even big ones) also occur on all other days when I compare them with the data available in GitHub today to previous data sets.

So what is true?

Unfortunately, visualizations from other users who relied on the last measurement data of that day are no longer correct. If a last measurement was many hours before the end of the day, a new one-day statistic can't show any minus differences for that day?

China vs WHO confirmed cases

Will WHO statistics be used for your dataset versus those published from Chinese sources?

China changed its confirmed case statistic to include clinically diagnosed cases (cat scan of patients lungs to see if they're infected) in addition to laboratory confirmed cases. The WHO doesn't agree with this change and are publishing statistics based on laboratory confirmed cases.

Thanks

Use ISO 3166-1 alpha-3 country codes

https://unstats.un.org/unsd/tradekb/knowledgebase/country-code

It will make data overlay into other systems much easier, especially when correlating with travel document information.

Github CSV to Google Sheets

I have a pretty intensive set of calculations I have already setup on my old google sheets file. Is there a way to continue to update my google sheets with the new file released twice a day from this repository?

add deaths in graphycs

Hi,

could you add deaths by day in graphycs.

for better preview " deaths / recovers / infected"

thanks

Alternative languages

Is there any plans on supporting multiple languages? Especially chinese and japanese?

How are transferred individuals counted?

There are special flights out of Wuhan and Japan will let some individuals off of the cruise ship. So how are those transferred individuals counted in the data?

Let's say a cruise ship passenger has the virus and is allowed off the ship, but they die after. Is that a cruise ship death or a Japan death?

Thailand data issue

From files "01-28-2020_1300.csv" to "01-31-2020_1400.csv", Thailand is listed as having 5 "Recovered" cases.

Then in file "02-01-2020_1000.csv", Thailand is listed as having 7 "Recovered" cases, with an update time of "2/1/20 10:00".

Then from files "02-01-2020_1800.csv" to "02-01-07-2020_20204.csv", Thailand is again listed as having 5 "Recovered" cases.

Finally, in file "02-08-2020_1024.csv", Thailand is listed as having 10 "Recovered" cases, with an update time of "2/8/20 12:53".

It appears that either a) file "02-01-2020_1000.csv" should be edited to 5 "Recovered" cases, or b) files "02-01-2020_1800.csv" to "02-01-07-2020_20204.csv" should be edited to 7 "Recovered" cases.

(￢︿̫̿￢☆)

Thank you.

Thank you so much for providing us with this data.

I don't see what the issue was with google sheets though. I'm sure anyone who used this data could convert import export it in any way they wanted.

Every time you choose to stop one method of delivery and select to chose a new one brings a new challenge, that is very good for learning new skills. I've never imported and integrity checked csv files from github into google sheets before.

I would appreciate though, for the future, to either keep supporting the previously offered delivery formats or sticking with the current one. Maybe start a reddit or other "user support" page where people who are unable to get what they need can ask others how they can achieve what they want?

If they wanted to be able to download a CSV from your published sheet it was as easy as creating an empty google docs sheet file, importing your 3 tabs to it then saving it as a csv file from there.
Someone would have taken the time to explain them how to do it, I'm sure.

Again, thank you so much for taking the time to centralize all this data and keeping it up to date for us.

Friendly regards.
A

Add Singapore's Ministry of Health (MoH) as one of the official sources

The official MoH webpage is: https://www.moh.gov.sg/covid-19

Contents of csv files are overwritten. Why?

How can it be that in, for example, the last data (daily_case_updates) of 12 February 2020, 22.00 (02-12-2020_2200.csv), there are update data with the date 13 February 2020?

Then the data is no longer correct? It's like when I name a file January 10th, but keep updating the contents. Then nothing at all matches anymore.

currently sick

is displayed 1) "Total Confirmed" 2) "total Deaths" 3) "total Recovered"

is it possible to show under "Total confirmed" (1) the number how many people are under treatment are ?

(for example "Actual cases" would be Total Confirmed -(minus) Total Deaths - Total Recovered )

please

Problem with Singapore data

Singapore data of confirm cases the first few days from 23 Jan to 30 Jan were changed to 0. From 31 Jan it was correct at 13 cases. Can you change this? Thanks! Previous days the file was ok. Only problem from yesterday or today.

Please release data twice a day

I read through the new README and csv file for 02-14 and I have some concerns: under the new plan, you make release before 8AM Beijing time and most provinces in China have not updated their data yet; the next release will not come until 24 hours later. (China normally release data between 8-10am.) Please make another daily release at around 4am UTC if possible. Thanks.

Covid2019 | API

https://covid2019-api.herokuapp.com/
https://github.com/nat236919/Covid2019API

This API contains the current data, it gives the calculated sets of data to developers or those who need to analyse or present data swiftly.

** MIT - Please feel free to contribute and comment
** All credits go to those who work behind this amazing repository (CSSEGISandData
/COVID-19)

Yokohama Lng

Is the Diamond Cruise Lng right? I think it should be 139.638 (Yokohama) instead of 129.638.

Data source for global cases?

I was using this feed, but it stopped working today.

https://coronavirus-tracker-api.herokuapp.com/all

I have no idea who runs it.

Missing data in `daily updates`

Comparing the Google sheets and the data in the daily update directory of this repository, quite a few datasets appear to be missing. In particular, all of these:

Was this data faulty or is this simply a synchronization error?

Please add README for new files

Looks like
https://github.com/CSSEGISandData/COVID-19/blob/master/csse_covid_19_data/csse_covid_19_daily_reports/02-13-2020.csv
is the same as
https://github.com/CSSEGISandData/COVID-19/blob/master/archived_data/daily_case_updates/02-13-2020_1000.csv
with some minor update, e.g., add 'T' between date and time in 'Last Update' column, so T means UTC?

See below screenshots:

More importantly, why did you choose 02-13-2020_1000.csv as the source to create 02-13-2020.csv???

If you need community's help to clean up the data to make file name, last update, time zone, etc, consistent, please let us know! I have proposed something in this reply, but you choose to ignore.

If you can do it by yourself, that's great and we as the community appreciates it VERY much. I'm afraid you did not do it right, more importantly, you did that in a rush w/o giving any warning. I know this is free source and you are helping the community, I understand and appreciate it very much, but choices like this really hurt. @CSSEGISandData

Created a JSON-based API.

Hello and thank you so much for maintaining this! I have created an API that reads your data and returns it in a way that's more friendly to use in programs. It also supports history. It's still a W.I.P and I would love contributions! It is open-sourced here: https://github.com/ExpDev07/coronavirus-tracker-api. Feel free to use it in your projects!

It is very fast due to caching.

The current endpoints are (more will be added):
https://coronavirus-tracker-api.herokuapp.com/confirmed
https://coronavirus-tracker-api.herokuapp.com/deaths
https://coronavirus-tracker-api.herokuapp.com/recovered

For all of them combined:
https://coronavirus-tracker-api.herokuapp.com/all

Time Series - Confirmed

The data point for confirmed cases in Hubei does not change for 6 Feb (and maybe the other provinces). My history from dxy.cn shows Hubei had 22112 confirmed cases in their 6 Feb (CST) update.

Great work putting this together. I've just linked to your tables rather than manually pulling from dxy.

Suspected cases

Hi.. is there any "suspected cases time series"?

JHU Data Backend Updates (NOTICE)

It makes for a lot of down time on the developers end if the file structure is constantly changing without a warning. Maybe a notice could allow developers to make changes before the edits go live. :D

An alternate visulization

Thank you very much for your effort and making curated data available to the world! Just wanted to bring to your attention that we have used some of the data that you have generously made available (along with data curated by us) to build a dashboard. This dashboard provides an alternate way of examining surveillance data. In particular:

County-level statistics for the United States (click on a State to view), and state/province-level statistics for Canada, Chile, India and Germany;
A time slider to view all the historical data;
An interactive chart for cumulative and daily number;
A visualization of all reported Coronavirus incidence data, filtered by date;
A heatmap of selected attributes on an interactive map;
A Query tool that allows users to focus on regions of interest;
The ability to select regions by clicking on the map; to select multiple regions at once, hold the “command” key on the Mac or the “ctrl” key on Windows while clicking;
Users can export subsets of the data for analysis on external tools.

Please see: https://nssac.bii.virginia.edu/covid-19/dashboard/

Documentation:
https://nssac.github.io/covid-19/dashboard/

Time zone

A suggestion: indicate the time zone (Eastern Standard Time if I remember well) or, even better, use UTC?

Keep up the good work!

Getting more information (ages)?

Where can we get more detailed data?
With information about the patient's age, the date when they were infected...

keep the shape of the data consistent

A column in the time_series_2019-ncov-Confirmed.csv used to be named 'First confirmed date in country (est.)' but now is 'First confirmed date in country' - this small change brakes all the downstream analytics.

Besides, the column name is misleading since it contains dates of first confirmed cases in either state/province or in country - depending on which is the smaller administrative unit.

There could be 2 columns:

First confirmed date in province/country <-- with data from current 'First confirmed date in country' column
First confirmed date in country <-- optional, preserves compatibility

The 1st one showing data from former/current column 'First confirmed date in country (est.)'/'First confirmed date in country'.

The 2nd one showing actual first date for the country as a whole. The column is not strictly necessary since people who need it, will add it on their side, but it would preserve backward compatibility with existing analytical solutions.

Either way kindly please keep the names and data consistent because it causes errors and confusion in the analytic pipeline down the line.

Feature Request- Change red-only color of map bubbles to green -> red spectrum based on confirmed case rate by region.

I think the red bubbles on the map- currently representing the sheer number of confirmed cases via bubble size, would be even more useful if the color of these bubbles represented rate of change of confirmed cases by region. So, a large green bubble in an area would signify high total quantity of confirmed cases, but with zero growth, and a tiny red bubble would mean few total cases, but high rate of growth. A small legend showing the full color range and the min and / max growth rates would be useful as well.

Using Semicolon as Delimiter

Dear Contributors,

Because of some province/state have comma, I think it will be much better to use semicolon as delimiter (like the format you use before current format).

Thank you.

Data Problems

The data field "Province/States" has rows showing statuses (i.e. confirmed, deaths, severe etc.) along with actual provinces. This is clearly a mistake. Can you fix this?

Also, why did you do away with longitude/latitude? Can you bring it back?

Please commit to a format as to avoid introducing human error into the process.

Infected amount data type (Double/Int)

The .csv files for Confirmed, Dead and Recovered state the numbers as double.
Files in daily_case_updates/ have integer values.
What is the reason behind not having integers everywhere?

Not Csv Files

reading f
iles on https://github.com/CSSEGISandData/COVID-19/tree/master/archived_data/daily_case_updates
seams to be html files, not cvs files.

Data inconsistencies between dashboard and time_series files

It appears there are some inconsistencies in terms of data between what dashboard shows and what timeseries csv files record. If you look at:

2/13/2020 the graph says 59,8k cases in Mainland China while who_covid_19_sit_rep_time_series.csv indicates 46550 and time_series_2019-ncov-Confirmed.csv indicates 63841?

2/11/2020 the graph says 44.3k cases in Mainland China while who_covid_19_sit_rep_time_series.csv indicates 42708 and time_series_2019-ncov-Confirmed.csv indicates 44641?

2/10/2020 the graph says 42.3k cases in Mainland China while who_covid_19_sit_rep_time_series.csv indicates 40235 and time_series_2019-ncov-Confirmed.csv indicates 42310?

2/7/2020 the graph says 34.1k cases in Mainland China while who_covid_19_sit_rep_time_series.csv indicates 31211 and time_series_2019-ncov-Confirmed.csv indicates 34569?

SQL Version of this dataset

We are publishing a SQL Version of this dataset in Dolt, a SQL database with Git-style versioning if anyone is interested.

The Dolt repository can be found here:

https://www.dolthub.com/repositories/Liquidata/corona-virus

We wrote a blog post about SQL Views which also describes how the dataset can be used:

https://www.dolthub.com/blog/2020-02-10-introducing-sql-view-support-in-dolt/

The import job is open source and can be found here:

https://github.com/liquidata-inc/liquidata-etl-jobs/blob/master/airflow_dags/corona-virus/import-data.pl

The import job runs on the hour.

Created a python package to extract data and generate reports 📈

Had this working for the google sheets, but then decided to update for the github version

https://github.com/AaronWard/coronavirus-analysis

What does it do?

Extracts latest entry for each date from 2019-nCoV
creates an aggregated time series dataframe
creates summary report csv with information such as currently_infected for a given data
report diagrams to visualize the growth in confirmed cases, deaths and recoveries

If you just want to see visualizations, i update this repo daily so star the repo and check the readme👍

wrong data for Japan

Wrong data for Japan in time_series_2019-ncov-Confirmed.csv on 2/5/20 23:00 - 2/6/20 9:00 - 2/6/20 14:20 (45 confirmed) because next value 25

Wrong time in time_series_2019-ncov-Confirmed.csv

In file time_series_2019-ncov-Confirmed.csv exist 2/8/20 22:04 and 2/8/20 23:04 column but in other files we have 2/8/20 10:24 and 2/8/20 23:04

Add reported rate per day information chart for country selected

Appreciate the great work done by this team to provide this useful resource. Is it possible to add a feature to show the reported rate per day to the chart for the country selected? thank you

Keep the date time format consistent

In all files in time_series, all times follow a specific format (2/5/2020 9:00 AM): %m/%d/%Y %I:%M %p. The last datetime however, has a different format. The time is now in the 24-hour format, and the year has been shortened from 2020 to 20 (2/8/20 23:04). I believe this format is %m/%d/%y %I:%M.

Missing column in `time_series_2019-ncov - Recovered.csv`

Thank you so much for putting up the raw data!

I've noticed that in the file time_series_2019-ncov - Recovered.csv, the column with time stamp 1/31/2020 7:00 PM is missing while it's present in the corresponding Death- and Confirmed-files. Would you be able to comment on why that is?

I've noticed that the time stamp 1/31/2020 7:00 PM is missing in the daily update data, as well.

Thanks again!

Daily updates with no changes in Hubei data

Dear @CSSEGISandData, thank you for you work once again.

If I understand correct that the problem with same Hubei data in different daily updates (see the image) is probably related to when you publish your updates and when China publish Hubei updates (they delay the updates sometimes for a few hours). Will you fix this problem? Probably by publishing your daily update only after Hubei update already available or just a few hours later?

Or did I miss something and there is an easy way to extract for missing Hubei data from your files? There were a lot of changes in file structure, so maybe I have missed something...

But I see same issues in daily reports and time series - different dates, same Hubei data.

Thousands of people are already using my reports based on your data http://avatorl.org/covid-19/ and I hope

please add the time to the dates in the time series

Last i checked the time series files had both date AND time in the timestamps, but now it's down to only the dates. Can you please add the times again?

Data Connection

I had connected to this data originally when it was stored in the google doc via Microsoft PowerBI, just using the built in "Get Data - Web Connection."

When the data feed moved to git hub I just started pointing PowerBI to the most recently updated csv file.

It seems that method is no longer working, either, and instead of a table of data PowerBI is pulling in just some snippets of html code, and not the data table. If anyone has some suggestions where I've went wrong I would appreciate it.