Skip to content

Complete source code for AWS based Spark Batch workflow and Spark Structured Streaming workflow with Kafka Sources and Sinks.

Notifications You must be signed in to change notification settings

markcberman/CS598_CCC_SRC_CODE

Repository files navigation

CS598_CCC_SRC_CODE

Complete source code for Spark Batch (task 1) workflow and Spark Structured Streaming workflow with Kafka Sources and Sinks (task 2). Contains shell scripts so that code runs under YARN. Contains shell scripts to create Kafka topics. These workflows are designed to run on a 3 node cluster in AWS.

About

Complete source code for AWS based Spark Batch workflow and Spark Structured Streaming workflow with Kafka Sources and Sinks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published