Ella Kennedy, Manjary Muruganandan, Kiranjit Nagra, Aryan Chopra, Achchala Deepan
Steps to get started:
- run pip install -r requirements.txt to install dependencies in requirements.txt
- Download the rideshare_kaggle.csv, cleaned_data.csv, and the data_feature_engineering.csv file from the zipped data file uploaded to Learn
- Add the rideshare_kaggle.csv (which is uploaded on Learn) to this folder for the data preprocessing steps in uber_predictive_pricing.ipynb to the data folder in this repository
- Add the cleaned_data.csv file (which is generated with the data preprocessing steps and also uploaded on Learn) into the data folder for the feature engineering steps in uber_predictive_pricing.ipynb
- Add the data_feature_engineering.csv (which is generated at the end of the feature engineering steps and also uploaded on Learn) into the data folder, to be used the rest of the project (test/train split, modelling) in uber_predictive_pricing.ipynb
Empirical Evaluation: collecting data and proposing ML solutions to an area of real-life application
The general focus of this project is to create an optimal predictive pricing model to forecast Uber rideshare services, with a goal of enhancing current dynamic pricing strategies. This stems from the observation that traditional pricing models often do not fully account for the plethora of factors influencing ride costs, which can lead to lessened user satisfaction. By leveraging the comprehensive Boston, MA Uber and Lyft Kaggle dataset, our project seeks to incorporate a wider range of variables into the pricing algorithm.