Optimisation and Deep Learning course project
Kaggle Dataset: https://www.kaggle.com/geomack/spotifyclassification
- Data Understanding
- EDA
- Data Transformation: Feature Scaling using RobustScalar
- Feature Importance: Tree models - Decision Tree, Random Forest and Gradient Boosting
- Modelling: Logistic Regression
- Tuning: GridSearchCV & RandomizedSearchCV
- There is no strong correlation between likeability of the songs vs. the songs attributes present in the dataset chosen
- The data was then tuned using the aforementioned tuning methods including data splitting (80 - 20) and choosing the features shown during Feature importance.