lake House Architecture for data transformation and migration using various Azure services
-
Updated
Jul 8, 2024 - Jupyter Notebook
lake House Architecture for data transformation and migration using various Azure services
This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark SQL, data governance with Unity Catalog, and visualization through Power BI. Designed to showcase real-world data engineering workflows in Azure.
Add a description, image, and links to the azure-data-lake-storage-gen2 topic page so that developers can more easily learn about it.
To associate your repository with the azure-data-lake-storage-gen2 topic, visit your repo's landing page and select "manage topics."