Tokyo Olympic Azure Data Engineering Project

Overview

This project focuses on building a comprehensive data engineering pipeline for the Tokyo Olympic Games, leveraging Azure services such as Data Lake Gen2, Data Factory, Databricks, and Synapse Analytics. The pipeline aims to handle data integration, transformation, and analysis to support valuable insights for the Olympic events.

Technologies Used

Azure Data Lake Gen2: Storage for raw and processed data.
Azure Data Factory: Orchestration and automation of data workflows.
Azure Databricks: Advanced analytics and data transformation.
Azure Synapse Analytics: Data warehousing and analytics.

Project Structure

Data Ingestion: Raw data from various sources is ingested into Data Lake Gen2.
ETL Pipeline: Data is processed and transformed using Azure Data Factory, leading to curated datasets.
Advanced Analytics: Complex analytics and transformations are performed in Azure Databricks.
Data Warehousing: Synapse Analytics is utilized for scalable data warehousing and efficient querying.

Setup Instructions

Azure Account: Ensure you have an active Azure account.
Azure Resources: Create necessary Azure resources - Data Lake Gen2, Data Factory, Databricks, and Synapse Analytics.
Configuration: Update configuration files with your Azure credentials and project-specific details.
Run Pipelines: Execute Data Factory pipelines for ETL, monitor Databricks jobs, and utilize Synapse Analytics for analytics.

Usage

Follow the documentation provided in the 'docs' directory for detailed instructions on setting up, running, and maintaining the project.
For any issues or inquiries, refer to the 'issues' section in this repository.

Contribution

Contributions are welcome! Please follow the guidelines in the 'CONTRIBUTING.md' file.

License

This project is licensed under the MIT License.

Feel free to reach out for any questions or clarifications.

Happy coding!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Images		Images
data		data
data_csv		data_csv
README.md		README.md
SQL script.sql		SQL script.sql
Tokyo Transformation.ipynb		Tokyo Transformation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tokyo Olympic Azure Data Engineering Project

Overview

Technologies Used

Project Structure

Setup Instructions

Usage

Contribution

License

About

Releases

Packages

Languages

ayanhussain81/Olympics-Data-ETL

Folders and files

Latest commit

History

Repository files navigation

Tokyo Olympic Azure Data Engineering Project

Overview

Technologies Used

Project Structure

Setup Instructions

Usage

Contribution

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages