Skip to content

DE Project - Simple ELT Pipelipe which gets data from NY Taxi Trips, transform it and make the information available for futher analysis.

Notifications You must be signed in to change notification settings

warzinnn/ny-taxi-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NY Taxi Data - DE Project

Simple ELT Pipelipe which gets data from NY Taxi Trips, transform it and make the information available for futher analysis.

Overview

In this project, the creation and management of cloud resources was done with Terraform. The workflow orchestration was managed by Prefect, which coordenates the Python ETL and DBT (Data transformation), along the integrations with Google Cloud Plataform to communicate with cloud services (GCS, BigQuery), and also contain an integration with discord, to notify every time the deploy was runned. The docker images created to containerize the prefect server and prefect agent was pushed to Google Artifact Registry, and then used with Google Compute Engine to setup the compute instance which runs the prefect server and prefect agent respectively. In the end, the data is served on Looker Studio.

Pipeline Flow

pipeline_final

Tools and Technologies Used

Looker Studio Report example

pipeline_flow3

About

DE Project - Simple ELT Pipelipe which gets data from NY Taxi Trips, transform it and make the information available for futher analysis.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published