We are using this page to compile a list of available datasets and tools. If you are intersted in compute resources, please visit the Compute Resources page. Please bear in mind that this list comes from the community and given the pace things are moving, it could become out-of-date quite easy. If you know of any dataset or tool not listed here, please add it. And always remember to double check the usage policies and licenses before using someone else's data or tool. Topics/groups might include a subset of the lists included here as some will be more relevant than others depending the topic/group.
- Johns Hopkins repo
- European Centre for Disease Prevention and Control
- Automated Data Collection: COVID-19/SARS-COV-2 Cases in EU by Country, State/Province/Local Authorities, and Date
- EBI Data
- SARS-CoV-2 sequences GenBank
- nCoV sequences GISAID
- Please be aware of the licenses
- Kaggle (all COVID-19 Related challenges)
- Kaggle COVID-19 Open Research Dataset Challenge (CORD-19)
- COVID Epidemiology
- NY Times data
- NHS Covid19 symptom tracker
- Coronavirus Tracker API
- R package for the data colated by Johns Hopkins
- Penn Medicine - COVID19 Hospital Impact Model for Epidemics
- Epidemic Calculator by Gabriel Goh
- Error estimates in SIR model predictions, ETH Zurich
- COVID-19 Scenario simulator, University of Basel
- Pandemic Preparedness Planning for COVID-19, by Markus Schwehm and Martin Eichner together with the Landesgesundheitsamt Baden-Württemberg/Germany
- Worldometers COVID19 real-time stats
- Bioinformatics resources for SARS-CoV-2 from Clinical Bioinformatics Area from FPS (Junta de Andalucía, Spain)
- Cheminformatics from quarantine: some interactive COVID-19 resources
- Galaxy workflows. A lot is here already! Check for overlap.
- Nexstrain workflows. Here too: look for gaps, no reinventing wheels