COVID-19 Disease spread forecast

This repository implements a Recurrent Neural Network, which predicts the spread of COVID-19 accross the world. To accomplish this, a model is trained for each country, taking into account its nearest neighbors in terms of growth (later explained).

Getting Started

Installation

Clone this repo runnning the following command:

$ git clone https://github.com/rdbch/COVID-19-Forecast/
$ cd COVID-19-Forecast

Install PyTorch 1.0+ and other dependencies (pandas, numpy, seaborn, jupyter, etc).

pip $ pip install -r requirements.txt
conda $ conda create --name COVID-19-Forecast --file req.txt

Note: development was done using GPU accelerated version of Pytorch, so they have CUDA dependencies included.

Fetching the latest data

To have access to the latest data, please run

$ python scripts/fetch_new_data.py

This will download the new global data from Johns Hopkins University github repo and convert it to a more convenient format (the one used in Kaggles COVID-19 spread). They update the data on a daily basis.

Running the notebooks

To run the notebooks please start the jupyter server in ./COVID-19-Forecast (parent of ./notebooks):

$ jupyter notebook

Approach

Country nearest neighbour

Notebook: link

Rather than training a model for all countries, it is more suited to train a model for each individual one, using only the nearest neighbours countries in terms of growth. Please check the this notebook for more details. By doing this, we improve the predictions for the majority of countries.

Below it is explained how the nearest neighbors of a source country S, are obtained:

First, we discard the entries (days) which are below a specified alignment threshold T_a (have less than a specified number of cases), for every country (S included). Then, we take a candidate country, C_n. C_n must be more evolved than S (this means it reached T_a earlier). We start sliding S over C_n, beginning with the first day it reached the threshold, until C_n ends. For each such step, an error is computed. The smallest error will be the error associated with C_n. We do this for all countries that are available in the dataset, taking one feature f, f in {confirmedCases, fatalities} at a time. During training, the neighbours will be filtered by applying an error threshold T_error(f).

Below is provided a sample of the first 3 neaighbours for Romania. The data used for this was last updated on 03 may 2020.

Reccurent predictor

Notebook:link

A naive model based of reccurent cells is implied. The predictor was only trained on the neareast neighbours. To limit the growth, an unsupervised loss is used for smoothing out the long term prediction. Please check this notebook for more details.

Results

Below are the graphs for the results obtained for confirmed cases (left) and fatalities (right) of a country with an advanced disease spread and another with an average one. The predicted output represents a period of 60 days. The data used for this task was last updated on 26.04.2020 .

References

Pytorch example - Time Sequence Prediction

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
assets		assets
core		core
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conda_requirements.txt		conda_requirements.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COVID-19 Disease spread forecast

Table of contents

Getting Started

Installation

Fetching the latest data

Running the notebooks

Approach

Country nearest neighbour

Reccurent predictor

Results

References

About

Releases

Packages

Contributors 2

Languages

License

rdbch/COVID-19-Forecast

Folders and files

Latest commit

History

Repository files navigation

COVID-19 Disease spread forecast

Table of contents

Getting Started

Installation

Fetching the latest data

Running the notebooks

Approach

Country nearest neighbour

Reccurent predictor

Results

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages