Using decision tree for New Yorks taxi tip prediction.
-
Updated
Sep 2, 2023 - Jupyter Notebook
Using decision tree for New Yorks taxi tip prediction.
Developed a machine learning model to predict whether taxi passengers π will give a generous tip (β₯ 20%) πΈ based on trip and payment data. Utilized Python π, Pandas πΌ, NumPy π’, Scikit-learn π€, and XGBoost π for data preprocessing, feature engineering, modeling, and evaluation.
A decision tree regressor from SnapML by IBM was used to predict taxi tips on a dataset from the NYC TL Commission. The model was trained on over 3 million data samples in 0.636 seconds using multithreaded CPU/GPU acceleration. Achieved mean squared error = 1.62 on test data.
Add a description, image, and links to the taxi-tip-prediction topic page so that developers can more easily learn about it.
To associate your repository with the taxi-tip-prediction topic, visit your repo's landing page and select "manage topics."