Olist
is Brazil's largest department store connecting small businesses from all over Brazil.
The data includes 100K orders from the period between Oct-2016 to Oct-2018. The database includes information
about orders associated with customers sellers their payment methods products and locations.
In this project, first we construct database from scratch then we perform an in-depth analysis
of customer sellers and their orders and gain some critical insights
about the orders volume, delivery time, valuable customer markets, important sellers by revenue, etc using PostgreSQL
We perform analysis on four major categories of data provided by Olist
Analysis | Markdown | Notebook |
---|---|---|
Frequency of Orders | md | notebook |
Customers | md | notebook |
Sellers | md | notebook |
Products | md | notebook |
Miscelleaneous | md | notebook |
- Install postgreSQL and pgAdmin4
- Clone the repo
- Download csvs from [here](curl https://wagon-public-datasets.s3.amazonaws.com/olist/olist.zip) and unzip them inside
olist_data/
- open pdAdmin4 go to Server > LOCAL, Right click and CREATE and enter the database name
olist
- Go to olist database again right click and open Query tools
- Click the folder icon go to the downloaded folder and open
create_table.sql
, select the entire file and execute it, this will make all required tables - Again go to the folder and open
import_data.sql
, replace all path with your downloaded path inside your computer - Select all code and execute it, this will import all data
- Plug and play with the queries or do your own analysis
You can get some inspiration by looking at Frequency_analysis_of_orders.ipynb
to begin with!