COVID-19 dataset clearinghouse: Difference between revisions

From Polymath Wiki
Jump to navigationJump to search
 
(37 intermediate revisions by 2 users not shown)
Line 12: Line 12:
* [https://ourworldindata.org/coronavirus Coronavirus Disease (COVID-19) – Statistics and Research], Our World in Data, by Max Roser, Hannah Ritchie and Esteban Ortiz-Ospina
* [https://ourworldindata.org/coronavirus Coronavirus Disease (COVID-19) – Statistics and Research], Our World in Data, by Max Roser, Hannah Ritchie and Esteban Ortiz-Ospina
** [https://ourworldindata.org/coronavirus?fbclid=IwAR0d1z_t-W9OQVDE08wMbR3-RX-XPCaogFM5hUmFMqL2QZDgY9x-R2CRErE Cases + mortality rate daily]
** [https://ourworldindata.org/coronavirus?fbclid=IwAR0d1z_t-W9OQVDE08wMbR3-RX-XPCaogFM5hUmFMqL2QZDgY9x-R2CRErE Cases + mortality rate daily]
** https://ourworldindata.org/covid-testing How many tests for COVID-19 are being performed around the world?]
** [https://ourworldindata.org/covid-testing How many tests for COVID-19 are being performed around the world?]
* [https://github.com/CSSEGISandData/COVID-19 Novel Coronavirus (COVID-19) Cases], Johns Hopkins University Center for Systems Science and Engineering
* [https://github.com/CSSEGISandData/COVID-19 Novel Coronavirus (COVID-19) Cases], Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE)
** [https://github.com/datasets/covid-19 Novel Coronavirus 2019 time series data on cases], sourced and cleaned from the above data set
** [https://github.com/datasets/covid-19 Novel Coronavirus 2019 time series data on cases], sourced and cleaned from the above data set
** [https://github.com/summyfeb12/COVID-19-JHU-Data-API JSON Wrapper API for the JHU CSSE data]
** [http://shiny.science.ku.dk/pbm/COVID19/ A visualization of the JHU CSSE data]
* [https://github.com/covid19-data/covid19-data 2019-nCoV Data Processing Pipelines and datasets]   
* [https://github.com/covid19-data/covid19-data 2019-nCoV Data Processing Pipelines and datasets]   
** Countries and state names are normalized with ISO 3166-1 code.
** Countries and state names are normalized with ISO 3166-1 code.
Line 25: Line 27:
* [https://datarepository.wolframcloud.com/resources/Epidemic-Data-for-Novel-Coronavirus-COVID-19 Epidemic Data for Novel Coronavirus COVID-19], Wolfram  
* [https://datarepository.wolframcloud.com/resources/Epidemic-Data-for-Novel-Coronavirus-COVID-19 Epidemic Data for Novel Coronavirus COVID-19], Wolfram  
* [https://coronavirus-disasterresponse.hub.arcgis.com/datasets/bbb2e4f589ba40d692fab712ae37b9ac Coronavirus COVID-19 Cases], ESRI
* [https://coronavirus-disasterresponse.hub.arcgis.com/datasets/bbb2e4f589ba40d692fab712ae37b9ac Coronavirus COVID-19 Cases], ESRI
* [https://www.europeandataportal.eu/data/datasets/covid-19-coronavirus-data?locale=en COVID-19 Coronavirus data], European Union Open Data Portal
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/who/novel_coronavirus_situation_reports/ Novel Coronavirus (2019 nCoV) situation reports], World Health Organization
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/global/line_listings_oxford_github/ nCoV line listings from various sources and data processing], Oxford University
* [https://healthweather.us/?mode=Atypical US Health Weather Map], Kinsa Insights
** Cumulative amount of atypical illnesses observed since March 1


==== North America ====
==== North America ====
Line 39: Line 46:
** [https://covidtracking.com/api/ API]
** [https://covidtracking.com/api/ API]
* [https://github.com/kgjenkins/covid-19-ny Covid-19 coronovirus cases in New York State]
* [https://github.com/kgjenkins/covid-19-ny Covid-19 coronovirus cases in New York State]
* [https://covid19tracker.health.ny.gov/ COVID-19 Tracker], New York State Department of Health
* [https://a816-hrt.nyc.gov/DataCatalog/Pages/DataView.cshtml?id=5ff1078d-8e34-43de-b339-7556dd09b5b2 Communicable Disease Surveillance Data], NYC Health Department Data Resources
* [https://www.nytimes.com/article/coronavirus-county-data-us.html Coronavirus Case Data for Every U.S. County], New York Times
* [https://www.nytimes.com/article/coronavirus-county-data-us.html Coronavirus Case Data for Every U.S. County], New York Times
** [https://github.com/nytimes/covid-19-data Github repository]
** [https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html Interactive visualization]
* [https://drive.google.com/file/d/1Ec-PDaqthCOPjWNXqun1yopX45xeSc0k/view COVID-19 Coronavirus US Case Density over Time by State], using JHU CSSE data
* [https://drive.google.com/file/d/1Ec-PDaqthCOPjWNXqun1yopX45xeSc0k/view COVID-19 Coronavirus US Case Density over Time by State], using JHU CSSE data
* [http://coronavirusapi.com/ Coronavirus API Public Health Initiative]
** Record of official data from US government websites for the 50 states and DC
* [https://dph.georgia.gov/covid-19-daily-status-report COVID-19 Daily Status Report], Georgia Department of Public Health
* [https://www150.statcan.gc.ca/n1/en/type/data?text=COVID COVID statistics] (Canada), Statistics Canada
* [https://fr.flatten.ca/ FLATTEN] (Canada)
** Online screening tool to provide information on COVID-19
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/canada/ontario_situation_updates/ The 2019 Novel Coronavirus (2019 nCoV), Status of cases in Ontario], Ontario Ministry of Health
* [https://github.com/reichlab/covid19-forecast-hub Projections of COVID-19, in standardized format], The Reich Lab at UMass-Amherst
==== Europe ====
==== Europe ====


* [http://www.influenzanet.info/ Influenzanet]
* [https://leoss.net/ Studying SARS-CoV-2 in European patients], Lean European Open Survey on SARS-CoV-2 Infected Patients (LEOSS)
* [https://leoss.net/ Studying SARS-CoV-2 in European patients], Lean European Open Survey on SARS-CoV-2 Infected Patients (LEOSS)
* [https://github.com/pcm-dpc/COVID-19 COVID-19 Italia - Monitoraggio situazione]
* [https://github.com/pcm-dpc/COVID-19 COVID-19 Italia - Monitoraggio situazione]
Line 48: Line 69:
* [https://www.kaggle.com/sudalairajkumar/covid19-in-italy COVID-19 in Italy], Kaggle
* [https://www.kaggle.com/sudalairajkumar/covid19-in-italy COVID-19 in Italy], Kaggle
* [https://npgeo-corona-npgeo-de.hub.arcgis.com/search?groupIds=b28109b18022405bb965c602b13e1bbc RKI COVID19] (Germany), NPGEO Corona  
* [https://npgeo-corona-npgeo-de.hub.arcgis.com/search?groupIds=b28109b18022405bb965c602b13e1bbc RKI COVID19] (Germany), NPGEO Corona  
* [https://coronavirus.digitaler-harz.de/ Coronavirus API Deutschland] (Germany), Digitaler Harz
* [https://www.kaggle.com/headsortails/covid19-tracking-germany COVID-19 Tracking Germany], Heads or Tails
* [https://www.bag.admin.ch/bag/en/home/krankheiten/ausbrueche-epidemien-pandemien/aktuelle-ausbrueche-epidemien/novel-cov/situation-schweiz-und-international.html New coronavirus: Current situation – Switzerland and international], Bundesamt für Gesundheit.
* [https://www.bag.admin.ch/bag/en/home/krankheiten/ausbrueche-epidemien-pandemien/aktuelle-ausbrueche-epidemien/novel-cov/situation-schweiz-und-international.html New coronavirus: Current situation – Switzerland and international], Bundesamt für Gesundheit.
** [https://www.bag.admin.ch/dam/bag/de/dokumente/mt/k-und-i/aktuelle-ausbrueche-pandemien/2019-nCoV/covid-19-datengrundlage-lagebericht.xlsx.download.xlsx/200325_Datengrundlage_Grafiken_COVID-19-Bericht.xlsx data set]
** [https://www.bag.admin.ch/dam/bag/de/dokumente/mt/k-und-i/aktuelle-ausbrueche-pandemien/2019-nCoV/covid-19-datengrundlage-lagebericht.xlsx.download.xlsx/200325_Datengrundlage_Grafiken_COVID-19-Bericht.xlsx data set]
* [https://www.mscbs.gob.es/profesionales/saludPublica/ccayes/alertasActual/nCov-China/situacionActual.htm Situacion Actual] (Spain), Ministerio de Sanidad, Consumo y Bienestar
* [https://www.mscbs.gob.es/profesionales/saludPublica/ccayes/alertasActual/nCov-China/situacionActual.htm Situacion Actual] (Spain), Ministerio de Sanidad, Consumo y Bienestar
* [https://atlo.team/koronamonitor/ Koronamonitor] (Hungary), atlatszo.hu, atlo.team
* [https://www.europeandataportal.eu/data/datasets/fr-sars-cov-2?locale=en Fr-SARS-CoV-2] (France), European Data Portal
* [https://github.com/opencovid19-fr/data Consolidation des données de sources officielles concernant l'épidémie de COVID19] (France)
* [https://www.europeandataportal.eu/data/datasets/coronavirus?locale=en Coronavirus] (Netherlands), Rijksinstituut voor Volksgezondheid en Milieu (RIVM)
* [https://www.europeandataportal.eu/data/datasets/covid-19-zarazheni?locale=en Covid 19 - заражени] (Serbia), European Data Portal
* [https://www.europeandataportal.eu/data/datasets/covid-19-samoizolatsija?locale=en Covid 19 - самоизолација] (Serbia), European Data Portal
* [https://www.europeandataportal.eu/data/datasets/covidcountystatisticshpscireland?locale=en CovidCountyStatisticsHPSCIreland] (Ireland), Health Surveillance Protection Centre


==== Asia ====
==== Asia ====
Line 62: Line 92:
** [https://www.kaggle.com/kimjihoo/coronavirusdataset The data set on Kaggle]
** [https://www.kaggle.com/kimjihoo/coronavirusdataset The data set on Kaggle]
* [https://www.cdc.go.kr/board/board.es?mid=a30402000000&bid=0030 Press releases], Korea Centers for Disease Control and Prevention
* [https://www.cdc.go.kr/board/board.es?mid=a30402000000&bid=0030 Press releases], Korea Centers for Disease Control and Prevention
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/south%20korea/line_list_park_github/ COVID 19 South Korea], Sang Woo Park
* [https://github.com/ThisIsIsaac/Data-Science-for-COVID-19 COVID-19 Korea Dataset & Comprehensive Medical Dataset & visualizer], DS4C (Data Science for COVID-19) Project
* [https://hira-covid19.net/ #OpenData4Covid19], Ministry of Health and Welfare of Korea and Health Insurance Review and Assessment Service of Korea
** Medical history of COVID19 patients based on their insurance claims of the last five years.
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/south%20korea/confirmed_cases_movement/ Confirmed patient movement route], Korean Centers for Disease Control
* [https://covid19ph.com/ COVID-19 Philippines], Negros Island
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/china/death_count_imperial_college/ Hubei early deaths 2020 07 02], Imperial College
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/china/daily_cases_chinacdc_EN/ Distribution of new coronavirus pneumonia], China CDC
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/hong%20kong/doh_situation_updates/ Latest local situation of Severe Respiratory Disease associated with a Novel Infectious Agent] (Hong Kong), Hong Kong Center for Health Protection
* [https://github.com/midas-network/COVID-19/tree/master/data/cases/thailand/moh_situation_updates/ Novel Coronavirus 2019 Pneumonia Situation] (Thailand), Emergency Operation Center, Department of Disease Control


==== Other regional data ====
==== Other regional data ====


* [https://www.kaggle.com/unanimad/corona-virus-brazil Coronavirus (COVID-19) - Brazil Dataset], Kaggle
* [https://www.kaggle.com/unanimad/corona-virus-brazil Coronavirus (COVID-19) - Brazil Dataset], Kaggle
* [https://brasil.io/dataset/covid19/caso COVID-19] (Brazil)
** Boletins informativos e casos do coronavírus por município por dia
* [https://www.geopoll.com/blog/coronavirus-africa/ Coronavirus In Sub-Saharan Africa], Geopoll
* [https://www.health.nsw.gov.au/Infectious/diseases/Pages/covid-19-latest.aspx Latest updates on COVID-19], New South Wales
* [https://www.health.nsw.gov.au/Infectious/diseases/Pages/covid-19-latest.aspx Latest updates on COVID-19], New South Wales


Line 73: Line 116:
** [https://www.gisaid.org/registration/register/ Registration] is required.
** [https://www.gisaid.org/registration/register/ Registration] is required.
** [https://github.com/nextstrain/ncov Nextstrain build for novel coronavirus (nCoV)], based on GISAID data
** [https://github.com/nextstrain/ncov Nextstrain build for novel coronavirus (nCoV)], based on GISAID data
*** A [https://nextstrain.org/ncov Genomic epidemiology of novel coronavirus]
*** A [https://nextstrain.org/ncov Genomic epidemiology of novel coronavirus]  
* [https://www.kaggle.com/paultimothymooney/coronavirus-genome-sequence Coronavirus Genome Sequence], Kaggle
* [https://www.kaggle.com/paultimothymooney/coronavirus-genome-sequence Coronavirus Genome Sequence], Kaggle
* [https://www.kaggle.com/paultimothymooney/repository-of-coronavirus-genomes Repository of Coronavirus Genomes], Kaggle
* [https://www.kaggle.com/paultimothymooney/repository-of-coronavirus-genomes Repository of Coronavirus Genomes], Kaggle
Line 86: Line 129:
* [https://www.ncbi.nlm.nih.gov/research/coronavirus/ LitCovid] - a curated literature hub for tracking up-to-date scientific information about the 2019 novel Coronavirus
* [https://www.ncbi.nlm.nih.gov/research/coronavirus/ LitCovid] - a curated literature hub for tracking up-to-date scientific information about the 2019 novel Coronavirus
* [https://connect.biorxiv.org/relate/content/181 COVID-19 SARS-CoV-2 preprints from medRxiv and bioRxiv]
* [https://connect.biorxiv.org/relate/content/181 COVID-19 SARS-CoV-2 preprints from medRxiv and bioRxiv]
* [http://biomed-sanity.com/ BioMed Sanity], karpathy
** Indexing bioRxiv papers on COVID-19
* [https://pages.semanticscholar.org/coronavirus-research COVID-19 Open Research Dataset (CORD-19)], Allen Institute for AI, Microsoft, NLM, CZI, Georgetown University  
* [https://pages.semanticscholar.org/coronavirus-research COVID-19 Open Research Dataset (CORD-19)], Allen Institute for AI, Microsoft, NLM, CZI, Georgetown University  
** Over 44,000 scholarly articles, including over 29,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community
** Over 44,000 scholarly articles, including over 29,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community
** requested by the White House Office of Science and Technology Policy, and part of the [https://www.whitehouse.gov/briefings-statements/call-action-tech-community-new-machine-readable-covid-19-dataset/ Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset]
** requested by the White House Office of Science and Technology Policy, and part of the [https://www.whitehouse.gov/briefings-statements/call-action-tech-community-new-machine-readable-covid-19-dataset/ Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset]
* [http://eppi.ioe.ac.uk/cms/Projects/DepartmentofHealthandSocialCare/Publishedreviews/COVID-19Livingsystematicmapoftheevidence/tabid/3765/Default.aspx COVID-19: a living systematic map of the evidence], EPPI
=== Experts ===
* [https://www.science.org.au/covid19/experts COVID-19 Expert Database], Australian Academy of Science
** A mechanism for governments, the business sector, the research sector, and other decision-makers to easily access the expertise they need to inform their decision making.


=== Medical imagery and records ===
=== Medical imagery and records ===
Line 95: Line 146:
* [https://www.kaggle.com/darshan1504/covid19-detection-xray-dataset COVID-19 Detection X-Ray Dataset], Kaggle
* [https://www.kaggle.com/darshan1504/covid19-detection-xray-dataset COVID-19 Detection X-Ray Dataset], Kaggle
* [https://www.sirm.org/category/senza-categoria/covid-19/ COVID-19: casistica radiologica Italiana], Società Italiana di Radiologia Medica e Interventistica
* [https://www.sirm.org/category/senza-categoria/covid-19/ COVID-19: casistica radiologica Italiana], Società Italiana di Radiologia Medica e Interventistica
* [https://www.covid19challenge.eu/ Fighting Covid-19 Challenge]
** A platform for open research on large Covid-19 imaging datasets


=== Healthcare and equipment ===
=== Healthcare, vaccine development and equipment ===


* [https://coronavirus-disasterresponse.hub.arcgis.com/datasets/definitivehc::definitive-healthcare-usa-hospital-beds Definitive Healthcare: USA Hospital Beds], ESRI
* [https://coronavirus-disasterresponse.hub.arcgis.com/datasets/definitivehc::definitive-healthcare-usa-hospital-beds Definitive Healthcare: USA Hospital Beds], ESRI
* [https://www.arcgis.com/home/webmap/viewer.html?webmap=6afcaeb7549f4390b07224a0be01b3a6 COVID-19 Provider Practice Locations], ArcGIS.
* [https://www.arcgis.com/home/webmap/viewer.html?webmap=6afcaeb7549f4390b07224a0be01b3a6 COVID-19 Provider Practice Locations], ArcGIS.
* [https://github.com/EdisonDesign/HealthOS Prototype of the HealthOS ventilation system], Edison Design
* [https://github.com/EdisonDesign/HealthOS Prototype of the HealthOS ventilation system], Edison Design
* [https://e-vent.mit.edu/ MIT Emergency Ventilator], MIT
* [https://www.covidcaremap.org COVID Care Map]
** Open geospatial work to support health systems' capacity (providers, supplies, ventilators, beds, meds) to effectively care for rapidly growing COVID19 patient needs
** [https://www.covidcaremap.org/maps/us-healthcare-system-capacity/#6.07/40.085/-75.195 Open map data on US health system capacity to care for COVID-19 patients]
* [https://milkeninstitute.org/covid-19-tracker COVID-19 Treatment and Vaccine tracker], Milken Institute


=== Other data ===
=== Social and traffic data ===


* [https://docs.google.com/forms/d/e/1FAIpQLSc501xfAzEPADOwRmsdHmu-v8aN14jnKHBmEmdJJcTgRLddqw/viewform Aggregated foot traffic data], Safegraph
* [https://docs.google.com/forms/d/e/1FAIpQLSc501xfAzEPADOwRmsdHmu-v8aN14jnKHBmEmdJJcTgRLddqw/viewform Aggregated foot traffic data], Safegraph
**  Needs non-commercial agreement to execute.
**  Needs non-commercial agreement to execute.
** [https://www.safegraph.com/dashboard/covid19-commerce-patterns?is=5e7a3815f20d617a17a33173 Sample visualization of Safegraph data]
** [https://www.safegraph.com/dashboard/covid19-commerce-patterns?is=5e7a3815f20d617a17a33173 Sample visualization of Safegraph data]
* [https://www.covidcaremap.org COVID Care Map]
** Open geospatial work to support health systems' capacity (providers, supplies, ventilators, beds, meds) to effectively care for rapidly growing COVID19 patient needs
** [https://www.covidcaremap.org/maps/us-healthcare-system-capacity/#6.07/40.085/-75.195 Open map data on US health system capacity to care for COVID-19 patients]
* [http://www.panacealab.org/covid19/ Covid-19 Twitter chatter dataset for scientific use], Panacea Lab, Georgia State University
* [http://www.panacealab.org/covid19/ Covid-19 Twitter chatter dataset for scientific use], Panacea Lab, Georgia State University
* [https://covid19obs.fbk.eu/assets/files/last_info.csv Infodemics data from Twitter], CoMuNe lab
** [https://covid19obs.fbk.eu/ Infodemics Observatory]
* [https://app.powerbi.com/view?r=eyJrIjoiODZjNDhmYjAtZGQ3Zi00MDRlLTllNzctYTRjMmI4MTU5YWUyIiwidCI6IjZmZmEyMmY0LTQ1NjgtNDEwNS1hZDQzLTJlM2FkNDcyNjk1NyIsImMiOjN9 COVID-19 Online Survey], Harvard Humanitarian Initiative
** Analyzes social-behavioral aspects of outbreak control
* [https://covid19-civiclytics.citibeats.com/#/ ¿De qué está hablando la ciudadanía durante la pandemia COVID-19?], Grupo BID
** Twitter data relating to COVID in Latin America
* [https://dataforgood.fb.com/tools/disease-prevention-maps/ Disease prevention maps], Facebook
* [https://www.unacast.com/covid19 ‍Social Distancing Scoreboard], Unacast
=== Economic and Policy ===
* [https://docs.google.com/spreadsheets/d/19wJZekxpewDQmApULkvZRBpBwcnd5gZlZF2SEU2WQD8/htmlview?usp=sharing&pru=AAABcO6Xep8*i6rvRQXGMf3qhTlWDRbaSw# Colleges and universities closed/migrating online for COVID-19], Bryan Alexander
* [https://www.notion.so/Schools-affected-by-COVID-19-a28139cb40814869a2cd64cc9453d82c Schools affected by COVID-19]
* [https://www.marinetraffic.com/research/measuring-the-coronavirus-impact-on-trade/ Measuring the Coronavirus’ impact on trade], Marine Traffic Research
* [https://www.top10vpn.com/news/surveillance/covid-19-digital-rights-tracker/ COVID-19 Digital Rights Tracker], Top10VPN
* [https://rexdouglass.github.io/TIGR/TIGR_landing_page.nb.html Crowd-sourced COVID-19 Dataset Tracking Involuntary Government Restrictions (TIGR)], Rex W. Douglass


=== Data scrapers and aggregators ===
=== Data scrapers and aggregators ===
Line 119: Line 190:


=== Visualizations, projections, summaries ===
=== Visualizations, projections, summaries ===
* [https://www.worldometers.info/coronavirus/ COVID-19 Coronavirus Pandemic], Worldometer
* [https://www.worldometers.info/coronavirus/ COVID-19 Coronavirus Pandemic], Worldometer
* [https://bnonews.com/index.php/2020/03/the-latest-coronavirus-cases/ Tracking coronavirus: Map, data and timeline], BNO News
* [https://bnonews.com/index.php/2020/03/the-latest-coronavirus-cases/ Tracking coronavirus: Map, data and timeline], BNO News
Line 125: Line 195:
* [https://infection2020.com/ Infection2020]
* [https://infection2020.com/ Infection2020]
* [https://covy.app/ covy.app]
* [https://covy.app/ covy.app]
* [https://www.coronatracker.com/ CoronaTracker]
* [https://ncov.dxy.cn/ncovh5/view/pneumonia?from=dxy&source=&link=&share= COVID-19 Global Pandemic Real-Time report], dxy.cn ([https://ncov.dxy.cn/ncovh5/view/en_pneumonia?from=dxy&source=&link=&share= English version])
* [https://ncov.dxy.cn/ncovh5/view/pneumonia?from=dxy&source=&link=&share= COVID-19 Global Pandemic Real-Time report], dxy.cn ([https://ncov.dxy.cn/ncovh5/view/en_pneumonia?from=dxy&source=&link=&share= English version])
* [https://www.ft.com/coronavirus-latest Coronavirus tracked: the latest figures as the pandemic spreads], Financial Times
* [https://www.ft.com/coronavirus-latest Coronavirus tracked: the latest figures as the pandemic spreads], Financial Times
Line 131: Line 202:
* [https://covidactnow.org/ COVID Act Now] - predictions of COVID cases in the US by state
* [https://covidactnow.org/ COVID Act Now] - predictions of COVID cases in the US by state
** [https://covidactnow.org/model The model used]
** [https://covidactnow.org/model The model used]
* https://covid19.healthdata.org/projections COVID-19 Projections], IHME
* [https://covid19.healthdata.org/projections COVID-19 Projections], IHME
* [https://91-divoc.com/pages/covid-visualization/?fbclid=IwAR3vdyvNKRRvfw1t_xEXMwfEO4WMA-sOEoiSF_-w5lH8aDRJMR28vcOm2J8 An interactive visualization of the exponential spread of COVID-19]
* [https://91-divoc.com/pages/covid-visualization/?fbclid=IwAR3vdyvNKRRvfw1t_xEXMwfEO4WMA-sOEoiSF_-w5lH8aDRJMR28vcOm2J8 An interactive visualization of the exponential spread of COVID-19]
* [http://nrg.cs.ucl.ac.uk/mjh/covid19/ CoVID 19 Worldwide Growth Rates], Mike Handley, UCL
* [http://nrg.cs.ucl.ac.uk/mjh/covid19/ CoVID 19 Worldwide Growth Rates], Mike Handley, UCL
* [https://corona.help/ Corona.help], Alex Dumitru
* [https://aatishb.com/covidtrends/ Covid trends], Aatish Bhatia, Minute Physics
* [https://hidden-fjord-23808.herokuapp.com/ COVID-19 Time Exploration]


=== Other lists and groups ===
=== Other lists, hubs, and groups ===


* [https://www.kaggle.com/datasets?search=covid-19 COVID-19 data sets], Kaggle
* [https://www.kaggle.com/datasets?search=covid-19 COVID-19 data sets], Kaggle
Line 147: Line 221:
** This is a collection of data that is available from the Esri Living Atlas as well as data from authoritative sources.
** This is a collection of data that is available from the Esri Living Atlas as well as data from authoritative sources.
* [http://www.firemountain.net/covid19.html COVID-19 information]
* [http://www.firemountain.net/covid19.html COVID-19 information]
* [https://coronavirustechhandbook.com/data Coronavirus Tech Handbook]
* [https://www.europeandataportal.eu/en/highlights/covid-19 European Data Portal for COVID-19]
* [https://docs.google.com/document/d/1JWeD1AaIGKMPry_EN8GjIqwX4J4KLQIAqP09exZ-ENI/edit#heading=h.ozu9c5nu5x43 Call for Action: COVID-19 Data Collaboratives]
* [https://docs.google.com/spreadsheets/d/1w4czUYSQ1AfOxw-H3Zei5LZAvNkQrSAGeIu60aeXkZs/edit#gid=0 Possible Covid datasets]
* [https://docs.google.com/spreadsheets/d/1N_aVyjQWBzPT_MHiAWyoEqNgX4nKkoU7FznVZpvFTu0/edit#gid=1936591204 COVID-19 Data Providers], Amass Insights
* [https://alan-turing-institute.github.io/COVID-19_PSTC/ COVID-19 Pandemic Symptom Trackers], Alan Turing Institute
* [https://community.wuhan2020.org.cn/en-us/ Wuhan2020]
** a real-time and synchronous data service for hospitals, factories, procurement and other information
* [https://www.covid19-dataexchange.org/initiative COVID-19 Data Exchange], Dawex
* [https://www.endcoronavirus.org/ EndCoronavirus]
* [https://midasnetwork.us/covid-19/ Online Portal for COVID-19 Modeling Research], MIDAS
* [https://console.cloud.google.com/marketplace/details/bigquery-public-datasets/covid19-dataset-list?preview=bigquery-public-datasets COVID-19 Public Datasets], Google Cloud


== Data or Data cleaning requests ==
== Data or Data cleaning requests ==
Line 165: Line 251:


Contact: c.strohmeier@math.ucla.edu
Contact: c.strohmeier@math.ucla.edu
 
=== From Juan José Piñero de Armas (U. Católica de Murcia), Mar 27 ===
=== From Juan José Piñero de Armas (U. Católica de Murcia), Mar 27 ===


Line 178: Line 264:


Contact: jjpinero@ucam.edu
Contact: jjpinero@ucam.edu
== Miscellaneous ==
* [https://epcced.github.io/ramp/ Rapid assistance in modelling the pandemic: RAMP]
**  A call for assistance, addressed to the scientific modelling community Coordinated by the Royal Society
* [https://www.nsf.gov/pubs/2020/nsf20052/nsf20052.jsp Letter on the Coronavirus Disease 2019 (COVID-19)], National Science Foundation
** A solicitation for RAPID funding requests relating to COVID-19
* [https://coronacheck.eurecom.fr/en CoronaCheck: Computational Fact Checking for Statistical Coronavirus Claims], Paolo Papotti (EURECOM), Immanuel Trummer (Cornell)
* [https://en.wikipedia.org/wiki/Wikipedia:WikiProject_COVID-19 COVID-19 Wikiproject]
* [https://helpwithcovid.com/ Help with COVID]
** New or established projects helping with the COVID-19 crisis that need help
* [https://airtable.com/shrPm5L5I76Djdu9B/tbl6pY6HtSZvSE6rJ/viwbIjyehBIoKYYt1?blocks=hide COVID-19 Solutions], Airtable

Latest revision as of 19:03, 26 April 2020

This is a repository for public data sets relating to the COVID-19 pandemic. It was also initially envisioned as a clearinghouse for matching requests for data cleaning of such datasets with volunteers willing to perform this clearing, but the existing clearinghouse at United against COVID-19 is already up and running for this purpose, so we are redirecting such requests to that site in order not to fragment the pools of requests and volunteers.

For discussion of this project, see this blog post.

Data sets

Further contributions are very welcome, and can be made either directly to this wiki page (after requesting an account), or placed in the comments to this blog post, or by email to tao@math.ucla.edu.

Epidemiology

North America

Europe

Asia

Other regional data

Genomics and homology

Literature

Experts

  • COVID-19 Expert Database, Australian Academy of Science
    • A mechanism for governments, the business sector, the research sector, and other decision-makers to easily access the expertise they need to inform their decision making.

Medical imagery and records

Healthcare, vaccine development and equipment

Social and traffic data

Economic and Policy

Data scrapers and aggregators

Visualizations, projections, summaries

Other lists, hubs, and groups

Data or Data cleaning requests

As mentioned at the top of this page, future requests for data or data cleaning should be directed to this data discourse page at United Against COVID-19. Below are the legacy requests of this project prior to this redirect.

From Chris Strohmeier (UCLA), Mar 25

The biorxiv_medrxiv file at https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge contains another folder titled biorxiv_medrxiv, which in turn contains hundreds of json files. Each file corresponds to a research article, at least tangentially related to COVID-19.

We are requesting:

  • A tf-idf matrix associated to the subset of the above collection which contain full-text articles (some appear to only have abstracts).
  • The rows should correspond to the (e.g. 5000) most commonly used words.
  • The columns should correspond to each individual json file.
  • The clean data should be stored as a npy or mat file (or both).
  • Finally, there should be a csv or text document (or both) explaining the meaning of the individual rows and columns of the matrix (what words do the rows correspond to? What file does each column correspond to).

Contact: c.strohmeier@math.ucla.edu

From Juan José Piñero de Armas (U. Católica de Murcia), Mar 27

We request information (on a person basis) to perform survival analyses, regressions with random effects, etc. Some data exists for instance at

https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset/data https://www.kaggle.com/kimjihoo/coronavirusdataset https://www.kaggle.com/imdevskp/covid-19-analysis-visualization-comparisons/data https://www.sirm.org/category/senza-categoria/covid-19/

but we need much more detail (date when each person was diagnosed, date of infection for the same person, discharge date, date of death, gender, age, treatments, temperatures...) not just summaries or country-aggregated data.

Contact: jjpinero@ucam.edu

Miscellaneous