This is an analysis to predict customer purchase behavior based on an analysis in customer purchase intention. Slide presentation about this project available to read here. Dataset in this analysis is part of the research by Sakar, Polat, and Katircioglu (2018). Here's the detailed information about it.
Understanding consumer behavior is one of the most crucial components to increase the growth of revenue. Based on data, consumers who finish the transaction are less than 20%. For this reason, understanding consumers' purchase intention will be a foundational information to understand users and source to develop a strategy to maximizing revenue.
Understanding how customers move down the funnel to identify what to fix in the process to increase the number of transactions by answering several questions, such as:
- Trend of Revenue by Month
- Customers Engagement (Page View and Page Duration) by Month
- Customers Engagement (Page View and Page Duration) by Visitor’s Type
- Is Engagement Performance Related to the Operating System?
The analysis started with an understanding of variables in the dataset through Exploratory Data Analysis. After that, consumers will be clustered based on similar behavior using k-means. All of the code related to analysis are available in the following link:
- Descriptive analysis
- Graph analysis
- Handling data imbalance
- Handling data outliers
Three is the best number for clustering consumers yielded by silhouette analysis.
Data has many outliers and noise
Analysis Conclusion
- Based on several variables used in clustering, three variables have the same value. Therefore, they're ineffective to be used as a basis of behavioral analysis. These variables are Informational, Informational Duration, and Page Values.
- Exit Rates and Bounce Rates have a high (0.91) correlation value based on heatmap analysis. It also happens in clustering analysis, the higher the bounce rates, the higher the exit rates.
- Page View and Page Duration in each cluster show a correlation because the more consumers access a page, the more time they spend.
Business Recomendation
Based on the analysis, the website's performance influences the amount of earned revenue. It can be improved by redesigning the user interface to create a better user experience.
- Customer Intention
- Bounce Rates
- Data Cleansing
- K-means
- PCA
- Output Analysis