Skip to content

This project implements an ETL (Extract - Transform - Load) data pipeline with the Seattle public datasets on Fire 911 calls and Crime, using Apache Airflow (orchestration), PySpark/GCP Dataproc Serverless (transformation) and GCP Cloud Storage/BIgQuery (storage)

Notifications You must be signed in to change notification settings

spark0698/seattle-fire-and-crime

About

This project implements an ETL (Extract - Transform - Load) data pipeline with the Seattle public datasets on Fire 911 calls and Crime, using Apache Airflow (orchestration), PySpark/GCP Dataproc Serverless (transformation) and GCP Cloud Storage/BIgQuery (storage)

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published