3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow
airflow cloud sql spark nosql amazon-emr s3-bucket data-warehouse amazon-redshift data-pipeline normalization yelp-dataset 3nf dimensional-tables data-marts etl-process
-
Updated
Aug 17, 2019 - Jupyter Notebook