Complete source code for Spark Batch (task 1) workflow and Spark Structured Streaming workflow with Kafka Sources and Sinks (task 2). Contains shell scripts so that code runs under YARN. Contains shell scripts to create Kafka topics. These workflows are designed to run on a 3 node cluster in AWS.
-
Notifications
You must be signed in to change notification settings - Fork 0
markcberman/CS598_CCC_SRC_CODE
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Complete source code for AWS based Spark Batch workflow and Spark Structured Streaming workflow with Kafka Sources and Sinks.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published