This repository contains a production-grade ETL (Extract, Transform, Load) pipeline built with AWS Glue and Amazon Redshift. The pipeline processes a raw IMDb movie dataset stored in Amazon S3, applies data quality validation, dynamically routes data based on validation results, and loads it into Amazon Redshift for advanced analytic
-
Updated
Jan 24, 2025 - Python