The purpose of this project is to generate descriptive statistics and look into trends of movie and TV show releases on Netflix over the years. This dataset is from Kaggle
release_year | |
---|---|
count | 8807.000000 |
mean | 2014.180198 |
std | 8.819312 |
min | 1925.000000 |
25% | 2013.000000 |
50% | 2017.000000 |
75% | 2019.000000 |
max | 2021.000000 |
This project contains:
- requirements.txt detailing the requirements needed for this project
- Makefile to install requirements; lint with Ruff; format with black; and test your notebook, script, and lib files
- github actions with separate YAML files for Install, Lint, Format, and Test in the github workflows folder
- DockerFile and devcontainer for environment set up
- Jupyter notebook performing descriptive statistics and tested with nbval
- script.py Python script for statistics and generating data visualizations
- test_script.py to test the script
- lib.py Python script file that shares code between the script and notebook
- test_lib.py to test library script
- Summary pdf containing the walkthrough and conclusions found in the data analysis
- Open codespaces
- Load repo to code spaces
- Wait for installation of all requirements in requirements.txt
(alternatively, do all with make all
)
Here is the link to my demo video