Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
-
Updated
Sep 1, 2022 - Python
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
Apache Hudi examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Apache Icebery examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
Deltalake examples designed to be run on AWS Elastic Map Reduce (EMR) via. EMR Studio or EMR Notebooks
EMR Notebooks and SageMaker using Terraform.
PennBook is a highly scalable implementation of the core functionalities of facebook.com. It uses a Node.js server, React.js for the frontend, and Hadoop libraries such Apache Spark along with AWS Elastic MapReduce for the Big Data functionalities.
Hive Workshop using CloudFormation.
Pig Workshop using CloudFormation.
Cluster Creation using Terraform.
Hudi Workshop using Terraform.
Hudi Workshop using CloudFormation.
EMR Notebooks and SageMaker using CloudFormation.
Spark-based ETL using Terraform.
Orchestrating Amazon EMR with AWS StepFunctions using Terraform.
EMR Managed Scaling using CloudFormation.
Orchestrating Amazon EMR with AWS StepFunctions using CloudFormation.
Presto Workshop using Terraform.
EMR Managed Scaling using Terraform.
Pig Workshop using Terraform.
Presto Workshop using CloudFormation.
Add a description, image, and links to the elastic-map-reduce topic page so that developers can more easily learn about it.
To associate your repository with the elastic-map-reduce topic, visit your repo's landing page and select "manage topics."