GitHub - steggie3/loan-default-prediction: Loan Default Prediction Machine Learning Project

Loan Default Prediction Machine Learning Project

This is an exploratory project for me to apply and compare different ML models and techniques, including:

Feature Engineering
- One-hot encoding for categorical features
- Normalization/standardization
- Imputation
- Feature expansion
- Feature reduction:
  - Feature hashing
  - Feature selection
  - Principal component analysis
- Feature discretization with Decision Trees or Random Forests
Machine Learning Models:
- Logistic Regression
- Decision Trees, Random Forest, Gradient-Boosted Decision Trees
- K-Nearest-Neighbor
- Support Vector Machines
- Neural Networks

The data is from a Kaggle competition Loan Default Prediction.

Dependencies

Python 3, numpy, pandas, scikit-learn, matplotlib, xgboost, tensorflow, keras.

Usage

Download train_v2.csv from https://www.kaggle.com/c/loan-default-prediction/data and put in the loan-default-prediction directory. Running the first Jupyter notebook, LDP 01 - Data Preprocessing.ipynb will give you a few processed CSV files, which are used in subsequent notebooks as the training data. The rest of the notebooks do not have strong dependencies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Loan Default Prediction Machine Learning Project

Dependencies

Usage

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LDP 01 - Data Preprocessing.ipynb		LDP 01 - Data Preprocessing.ipynb
LDP 02 - Binary Classification with Logistic Regression.ipynb		LDP 02 - Binary Classification with Logistic Regression.ipynb
LDP 03 - Logistic Regression with Regularization and Feature Expansion.ipynb		LDP 03 - Logistic Regression with Regularization and Feature Expansion.ipynb
LDP 04 - Feature Dimension Reduction.ipynb		LDP 04 - Feature Dimension Reduction.ipynb
LDP 05 - Decision Trees.ipynb		LDP 05 - Decision Trees.ipynb
LDP 06 - Feature Selection with Decision Trees.ipynb		LDP 06 - Feature Selection with Decision Trees.ipynb
LDP 07 - Random Forest.ipynb		LDP 07 - Random Forest.ipynb
LDP 08 - Feature Discretization with Random Forest.ipynb		LDP 08 - Feature Discretization with Random Forest.ipynb
LDP 09 - Gradient Boosted Decision Trees.ipynb		LDP 09 - Gradient Boosted Decision Trees.ipynb
LDP 10 - Hyperparameters in Gradient Boosted Decision Trees.ipynb		LDP 10 - Hyperparameters in Gradient Boosted Decision Trees.ipynb
LDP 11 - Feature Discretization with Gradient Boosted Decision Trees.ipynb		LDP 11 - Feature Discretization with Gradient Boosted Decision Trees.ipynb
LDP 12 - Nearest Neighbor Classifiers.ipynb		LDP 12 - Nearest Neighbor Classifiers.ipynb
LDP 13 - Support Vector Machines.ipynb		LDP 13 - Support Vector Machines.ipynb
LDP 14 - Support Vector Machine Kernels.ipynb		LDP 14 - Support Vector Machine Kernels.ipynb
LDP 15 - Neural Networks.ipynb		LDP 15 - Neural Networks.ipynb
LDP 16 - Neural Networks with Regularization.ipynb		LDP 16 - Neural Networks with Regularization.ipynb
LDP 17 - Neural Networks with Different Architectures.ipynb		LDP 17 - Neural Networks with Different Architectures.ipynb
LDP 18 - Neural Networks with Multiple Outputs.ipynb		LDP 18 - Neural Networks with Multiple Outputs.ipynb
README.md		README.md
project_keras_utils.py		project_keras_utils.py
project_utils.py		project_utils.py
project_xgb_utils.py		project_xgb_utils.py

steggie3/loan-default-prediction

Folders and files

Latest commit

History

Repository files navigation

Loan Default Prediction Machine Learning Project

Dependencies

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages