The COVID pandemic has been an event unlike anything faced in recent history. For a year much of the world came to a crawl as people theorized when we might be able to return to "normal". Finally as vaccines became available it seemed like that might actually be a reality. However, a new, somewhat unforeseen, challenge soon emerged: getting the necessary amount of people to get fully vaccinated to reach herd immunity. So why, and where, are we having so much trouble getting the vaccine rolled out in the United States? Who is hesitant to get vaccinated? In this project we will look at the demographics of counties all across the country along with vaccine hesitancy and political voting data to try and answer these questions.
This repository contains code cleaning and aggregating vaccination and demographic data from the CDC, vaccination data from Texas Health and Human Services, population data from the US Census, and 2016 election voting data from the MIT Election Lab. We will also look for correlations between vaccination rate and these variables as well as map them looking for trends geographically.
The packages used in this analysis are listed in the requirements file in this repository and are: pandas, numpy, matplotlib, seaborn, geopandas, swifter, and shapely.
- Install Python 3 via Anaconda or other method.
- In command line navigate to the directory containing the requirements text file and run
pip install -r requirements_vacc.txt
to install the required packages.
- Clone the repo for this project to your computer ([instructions])(https://docs.github.com/en/github/creating-cloning-and-archiving-repositories/cloning-a-repository-from-github/cloning-a-repository)
- Run the python file
COVID_demo_geo_analysis.py
from the repo
I've uploaded my Jupyter Notebook to the repo, so if you just want to look at the code with more detailed explainations take a look at that file (COVID_Analysis-May23_Data.ipynb
)
CDC 'Vaccine Hesitancy for COVID-19: County and local estimates' dataset found here US Census county population data found here 2010-2016 Presidential Election voting records from the MIT Election Lab found here Texas county vaccination data from the Texas Health and Human Services found here.
All of these datasets are also availabilty in this repository, accurate as of May 23, 2021.