Project 2 (Working Title)

Overview

The distribution of scarce resources is a common problem. Specifically, test kits for the recent COVID-19 pandemic are in very short supply. It is still crucial to understand how many people are infected at which county. This project abstracts the problem as a classical multi-armed bandit AI problem. The goal is to determine if an intelligent agent (utilizing an epsilon-greedy algorithm) can outperform a naive approach when distributing test kits to counties as infections grow.

For this simulation there will be a fixed number of "arms" (i.e., counties). The simulation is run in T episodes, each lasting a simulated day.

Each day the number of infected patients at a county increase at a fixed rate.

Each day a fixed number of test kits are manufactured and ready to distribute.

The mathematical formula for this will be adjusted for the sake of the simulation, but should form an exponential increase to remain accurate. The rate may also be extrapolated from real-world data if possible.

A stretch-goal could be to contribute an agent implementation that may improve upon the epsilon-greedy approach.

Running the Project

Configuration is done inside simulation.py. Specify the counties you want to simulate, along with their populations, as the counties variable. Adjust start_date and end_date as desired.

option	type	description
`counties`	`List[County]`,	The list of counties to simulate. Each county must have its population specified in its constructor
`start_date`	`datetime`	The date from which to start the simulation (e.g., `datetime(2020, 1, 1)`)
`end_date`	`datetime`	The date at which to end the simulation (e.g., `datetime(2020, 3, 23)`)
`num_test_kits_per_day`	`int`	The number of test kits available for distribution per-day
`agent`	`Agent`	The type of agent to use in the simulation. Use `NaiveAgent` or `EpsilonGreedyAgent`.
`test_kit_evaluator`	`TestKitEvaluator`	The implementation of the class used to evaluate the test kits. Use `RandomTestKitEvaluator` or `PandasTestKitEvaluator`

Requirements are expressed using standard python setuptools. To install the dependencies run:

python3 setup.py install --user

To run the simulation, run:

python3 simulation.py

Daily reported infections will be printed to STDOUT. At the end of the simulation a graph will be generated with the current timestamp in a PNG form. The agents are evaluated based on minimizing the difference between the sum of the positive results measured for every county and the sum of the total actual results from every county.

Requirements

This project is built with Python 3.8 but has been tested with Python 3.6.

Task Environment (PEAS)

Performance Measure - The reward function is 1 point for a negative test and 10 points for a positive test. The agent's reward is the sum of the test results from each hospital.
Environment - The environment will be:
- partially observable - we can only measure the infection rate of counties that tests are distributed to
- stochastic - the next state of the environment is determined by the previous state, the current action, and a random element of increase in infections
- episodic - each round of the simulation represents a day
- sequential - the agent is aware of the previous "arm" reward, which feeds into exploration vs. exploitation trade-off
- dynamic - each county may increase the number of infections regardless of the chosen "arm"
- discrete - fixed number of test kits to distribute among a fixed number of counties in the sim
- single-agent - we only simulate the distribution of test kits, which is controlled by our single agent
Actuators - The agent interacts with the environment by distributing test kits
Sensors - The results of the distributed test kits from the previous round are reported back to the agent.

Data

Data is pulled from https://www.nytimes.com/article/coronavirus-county-data-us.html, which includes county-by-county historical infection data.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
agent.py		agent.py
county.py		county.py
environment.py		environment.py
epsilon_greedy_agent.png		epsilon_greedy_agent.png
epsilon_greedy_agent.py		epsilon_greedy_agent.py
experiment_results.ods		experiment_results.ods
fdoh.bib		fdoh.bib
ieee.csl		ieee.csl
naive_agent.png		naive_agent.png
naive_agent.py		naive_agent.py
nyt_dataset.bib		nyt_dataset.bib
pandas_result_consumer.py		pandas_result_consumer.py
pandas_test_kit_evaluator.py		pandas_test_kit_evaluator.py
print_result_consumer.py		print_result_consumer.py
random_test_kit_evaluator.py		random_test_kit_evaluator.py
report.md		report.md
report.pdf		report.pdf
result_consumer.py		result_consumer.py
result_scores.png		result_scores.png
scoring_strategy.py		scoring_strategy.py
setup.py		setup.py
simulation.py		simulation.py
test_kit_evaluator.py		test_kit_evaluator.py
us-counties.csv		us-counties.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 2 (Working Title)

Overview

Running the Project

Requirements

Task Environment (PEAS)

Data

Tasks (no order)

Resources

About

Releases

Packages

Contributors 2

Languages

joemccall86/cap5600-project2

Folders and files

Latest commit

History

Repository files navigation

Project 2 (Working Title)

Overview

Running the Project

Requirements

Task Environment (PEAS)

Data

Tasks (no order)

Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages