This repository represents a set of query and my analysis of eCommerce database in pdf file. I've stored all queries in one file, there are end-to-end query processes of this project. The purpose of this project is analyzing eCommerce business performance which the output is just calculation of various important KPIs/metrics and visualization also the interpretation of each calculation.
The database has 8 datasets which contain information of orders, order_items, order_payments, order_reviews, customers, product, seller and geolocation. However, I didn't use all of it's datasets; depends on metrics that I look for.
If any of you curious about the database or wanna to try by yourself, feel free to access the database from here.
I've used various tools on this project. Since the objective of this project was analyzing with SQL so I've used postgreSQL as my RDBMS platform. Then for visualization I've used Jupyter Notebook with python programming language.
Including the processes of generate tables, import it's data/attributes/values, define primary and foreign key and generate ERD (Entity Relationship Diagram).
Including the calculation processes of MAU (Monthly Active User), new customers and repeat order customers.
Including the calculation processes of top product category, top product revenue, most canceled product, most canceled product's order numbers, and total canceled customers. All of revenue's currency is set as '$'.
Including the calculation processes of customer's payment type favorite.
I presented the result of each progress in Bahasa Indonesia.