Skip to content

Customer Segmentation and Purchase Prediction project using clustering and classification techniques on e-commerce data. Built by Pratik Bokade using Python, Pandas, Scikit-learn, and Seaborn.

Notifications You must be signed in to change notification settings

Pratik-coder-arch/E-commerce-Customer-Segmentation-and-Prediction-BIA-Project

Repository files navigation

πŸ›οΈ E-commerce Customer Segmentation & Purchase Prediction

This project explores customer segmentation and prediction of purchase behavior using machine learning techniques on e-commerce customer data. It was developed by Pratik Bokade as part of a data analytics learning journey.


πŸ“Œ Objective

To analyze e-commerce customer behavior, segment users based on purchasing patterns, and predict the likelihood of future purchases using clustering and classification models.


🧠 What This Project Covers

  • 🧹 Data Cleaning and Preprocessing
  • πŸ“Š Exploratory Data Analysis (EDA)
  • πŸ” Customer Segmentation using KMeans Clustering
  • πŸ“ˆ Purchase Prediction using Logistic Regression, Random Forest, and XGBoost
  • πŸ“‰ Model Evaluation using Accuracy, Precision, Recall, and Confusion Matrix
  • πŸ“Œ Insights and Actionable Recommendations for marketers

πŸ› οΈ Tools & Technologies Used

  • Python 3.9+
  • Pandas – Data manipulation
  • NumPy – Numerical operations
  • Matplotlib & Seaborn – Data visualization
  • Scikit-learn – ML models and evaluation
  • XGBoost – Gradient boosting classification
  • Jupyter Notebook – Development environment

πŸ“‚ Project Structure


πŸ“Š Key Visualizations

  • RFM (Recency, Frequency, Monetary) Analysis
  • Clustered customer segments
  • Correlation heatmap
  • Feature importance for prediction model

βœ… Results

  • Identified 3–5 customer segments using KMeans clustering.
  • Best performing model: Random Forest with over 85% accuracy in predicting purchases.
  • Derived actionable insights for targeted marketing and personalization.

πŸ“ˆ Future Improvements

  • Integrate with real-time customer databases or dashboards (Power BI/Tableau).
  • Implement deep learning models for advanced predictions.
  • Create a recommendation engine for product suggestions.

πŸ“š Learnings

This project helped in strengthening my knowledge of:

  • Customer behavior modeling
  • Supervised vs. unsupervised learning
  • Real-world implementation of ML models in business contexts

πŸ™‹β€β™‚οΈ Author

Pratik Bokade
πŸ“§ Email Me
πŸ”— LinkedIn


⭐ If you found this project useful or inspiring, please give it a star!

About

Customer Segmentation and Purchase Prediction project using clustering and classification techniques on e-commerce data. Built by Pratik Bokade using Python, Pandas, Scikit-learn, and Seaborn.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published