🛡️🏦 BankShield: Safeguarding Customer Relationships

BankShield is a comprehensive bank customer churn prediction project. By utilizing various machine learning algorithms, we aim to identify the best model to predict customer churn and safeguard valuable customer relationships.

🏁 Overview

Customer churn is a critical issue in the banking sector. BankShield aims to predict which customers are likely to leave the bank, allowing for proactive measures to retain them. We have employed multiple machine learning algorithms to find the most effective model for churn prediction.

⚙️ Installation

To run this project locally, follow these steps:

Clone the repository:

git clone https://github.com/mayurd8862/Bank-Customer-Churn-Prediction.git

Navigate to the project directory:
```
cd Bank-Customer-Churn-Prediction
```
Create a virtual environment and activate it:
```
python -m venv env
env\Scripts\activate
```
Install the required packages:
```
pip install -r requirements.txt
```
Train and save best model:
```
python test.py
```
Run User Interface web app:
```
python app.py
```

🔬 Methodology

5.1. Problem Statement

The problem is to develop a machine learning model that predicts bank customer churn based on various customer attributes and transaction history.

5.2. Data

The dataset consists of more than 10,000 data points stored as rows with 14 features in columns. The features include process parameters such as:

CustomerId: Unique identifier for each customer.
Surname: Customer's last name.
CreditScore: Customer's credit score.
Geography: Country of the customer.
Gender: Gender of the customer.
Age: Age of the customer.
Tenure: Number of years the customer has been with the bank.
Balance: Account balance of the customer.
NumOfProducts: Number of bank product facilities customer is using.
HasCrCard: Whether the customer has a credit card (1: Yes, 0: No).
IsActiveMember: Whether the customer is an active member (1: Yes, 0: No).
EstimatedSalary: Estimated salary of the customer.
Exited: Whether the customer has churned (1: Yes, 0: No).

5.3. Algorithms Used

We implemented and compared the following algorithms to determine the best performer:

Logistic Regression
K-Neighbors Classifier
Random Forest Classifier
AdaBoost Classifier

5.4. Model Comparison

Each algorithm's performance was evaluated based on accuracy, precision, recall, F1-score, and AUC-ROC curve. The comparison helped identify the most effective model for predicting customer churn.

🛠️ Pipeline

The MLOps (Machine Learning Operations) pipeline is designed to create an end-to-end workflow for developing and deploying a web application that performs data preprocessing, model training, model evaluation, and prediction. The pipeline leverages Docker containers for encapsulating code, artifacts, and the frontend of the application. The application is deployed on a AWS to provide a cloud hosting solution.

🏆 Results

The results of the different algorithms are compared in terms of their performance metrics. The algorithm with the highest accuracy and best overall metrics is chosen as the final model.

🤝 Contributing

Contributions are welcome! Please fork the repository and create a pull request with your changes.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
artifacts		artifacts
notebook		notebook
src		src
static		static
templates		templates
.gitignore		.gitignore
.vercelignore		.vercelignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
eda.py		eda.py
pipeline.png		pipeline.png
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py
test.py		test.py
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️🏦 BankShield: Safeguarding Customer Relationships

🏁 Overview

⚙️ Installation

🔬 Methodology

5.1. Problem Statement

5.2. Data

5.3. Algorithms Used

5.4. Model Comparison

🛠️ Pipeline

🏆 Results

🤝 Contributing

📜 License

About

Releases

Packages

Languages

mayurd8862/Bank-Customer-Churn-Prediction

Folders and files

Latest commit

History

Repository files navigation

🛡️🏦 BankShield: Safeguarding Customer Relationships

🏁 Overview

⚙️ Installation

🔬 Methodology

5.1. Problem Statement

5.2. Data

5.3. Algorithms Used

5.4. Model Comparison

🛠️ Pipeline

🏆 Results

🤝 Contributing

📜 License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages