NIH COMmunity: COVID-19 Ongoing Monitoring - Phase 1

NIH-supported effort to rapidly integrate data from multiple sources, including surveys and wearable sensors, to identify individuals who may have undiagnosed COVID-19.

This repository provides ML models for detecting the onset of COVID-19-like symptoms from wearable data and a survey data-based model with the potential to differentiate flu-like symptoms from COVID-19.

Content

Installation - see package requirements (install via pip or conda)
See example notebook which includes a walk-through of training and evaluating a COVID-19 onset detection model using synthetic data
- Invokes functions from utils.py
Trained wearable model for reapplication to new datasets -- this can be subsituted in the example notebook above and reapplied to actual data

Example

Synthetic data is used to illustrate the model training and evaluation pipeline. Using the default generation function and parameters provided, discriminating signal around an Influenza-like Illness (ILI e.g. Flu or COVID-19) event onset can be seen in the normalized heart-rate derived features.

An XGBoost model with these 4 features and 5 days on lagging data is trained to disciminate between Healthy (Class-0) and COVID-19 (Class-1) days. Missing data is also handled by the model (potentially informative missingness) and performance decay (towards the end of the x-axis) with all-missing inputs can be seen.

Validation AUROC is used to select a model after hyperparameter tuning and a threshold (for prediction of COVID-19 events) is selected based on a 95% specificify cut-off

95% Specificity cutoff = 0.8778
              precision    recall  f1-score   support

           0       0.85      0.95      0.90       385
           1       0.83      0.57      0.68       154

    accuracy                           0.84       539
   macro avg       0.84      0.76      0.79       539
weighted avg       0.84      0.84      0.83       539

Cumulative recall of individual participants around event onset is used to evaluate performance of the COVID-19 detection model. However, a model trained without explicitly accounting for other ILI events is suseptible to confusing non-COVID-19 ILI events (red and blue lines in plot below) as COVID-19 (orange line)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
trained_models		trained_models
0__example_notebook.ipynb		0__example_notebook.ipynb
LICENSE		LICENSE
PRIVACY.md		PRIVACY.md
README.md		README.md
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NIH COMmunity: COVID-19 Ongoing Monitoring - Phase 1

Content

Example

About

Contributors 2

Languages

License

evidation-opensource/nih-community

Folders and files

Latest commit

History

Repository files navigation

NIH COMmunity: COVID-19 Ongoing Monitoring - Phase 1

Content

Example

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages