The goal is to predict whether or not an applicant will be able to repay a loan based of their payment behavior and demographic.
Part 1 - Defind
- Why is borrowers repayment ability a problem?
Part 2 - Discovery
- Load the data
- Data Quality Check
- Explore the data (EDA)
Part 3 - Develop
- Engineer features
- Encoding features
- Split data to train and test set
- Logistic Regression
- Decision Tree
- Random Forest
- Gradient Boosting
- ROC Graph
- Confustion Matrix
-
Python (Jupiter Notebook), packages; NumPy, Pandas, Matplotlib, Seaborn, Sklearn
-
Tableau
Kaggle