This project is the culmination of a Database Bootcamp at Altia. It uses Snowflake and the TPCH sample database to simulate a delivery business database. The project employs DBT Cloud for data modeling.
To build the project, the following steps were undertaken:
- Analysis and Understanding: Thorough analysis and comprehension of the TPCH database structure and data.
- Objective Setting: Establishing the project's objectives.
- RAW Data in Snowflake: Setting up raw data in Snowflake, designed to receive data from TPCH incrementally, simulating ongoing order receipt.
- Dimensional Modeling: Building and implementing a dimensional model using DBT.
- Aggregate Tables: Creating aggregate tables to facilitate analytics.
- Snowflake Dashboard: Developing a dashboard in Snowflake.
- User Access: Creating a customer user and granting access to the analytics and dashboard.
In addition to the TPCH database, custom tables for Shops and Events were created. These tables simulate different real-world scenarios such as varying currency exchange rates and different time zones. This allows for more robust testing and analysis of the data under diverse conditions.
The dashboard includes graphics and tables for data analysis. It features several filters for easy data manipulation.
This project demonstrates the integration of Snowflake and DBT Cloud to manage and analyze delivery business data effectively.