Skip to content

Latest commit

 

History

History
31 lines (16 loc) · 965 Bytes

File metadata and controls

31 lines (16 loc) · 965 Bytes

Project Overview

This project aims to segment customers based on their spending behavior and demographic information using clustering techniques. The workflow consists of data preprocessing, exploratory data analysis (EDA), and implementing K-Means and Hierarchical clustering.

  1. Data Preprocessing

Handle missing values and format date fields.

Encode categorical variables (Education, Marital_Status).

Standardize numerical features to ensure optimal clustering performance.

  1. Exploratory Data Analysis (EDA)

Analyze spending behavior using boxplots.

Generate a correlation heatmap to explore feature relationships.

  1. Clustering Techniques

A. K-Means Clustering

Use the Elbow Method to determine the optimal number of clusters.

Fit the K-Means algorithm (k=4) and visualize clusters using pair plots.

B. Hierarchical Clustering

Apply Ward’s method to form clusters.

Generate a Dendrogram (50 samples) for better readability and interpretation.