spark-clusters

Here are 18 public repositories matching this topic...

PiercingDan / spark-Jupyter-AWS

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

aws spark apache-spark ec2 jupyter aws-s3 jupyter-notebook spark-clusters ebs-volumes aws-ec2 ec2-instance apache-spark-cluster

Updated Nov 3, 2017
Jupyter Notebook

radanalyticsio / oshinko-cli

Star

Command line interface for spark cluster management app

spark openshift spark-clusters oshinko

Updated May 22, 2019
Go

airscholar / Japan-visa-data-engineering

Star

This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark clusters are set up within a Docker container on Azure.

python docker azure spark-clusters japan pyspark master-worker-architecture

Updated Oct 11, 2023
HTML

cameres / emr-spark-jupyter

Star

📓 Repository/Tutorial for initiallizing Jupyter Notebook and Spark cluster on Amazon EMR

emr tutorial spark jupyter cluster jupyter-notebook amazon-emr spark-clusters

Updated Dec 4, 2016
Python

bruler / kub-setup

Star

local kubernetes-based ml setup

docker kubernetes redis elasticsearch machine-learning airflow caffe theano kafka spark tensorflow gpu spark-clusters jupyterhub spark-streaming weave-scope zeppelin

Updated Mar 5, 2017
Shell

s8sg / spark-py-submit

Star

A python library to submit spark job in yarn cluster at different distributions (Currently CDH, HDP)

spark python-library spark-clusters hdfs cdh hdp spark-job

Updated Feb 10, 2017
Python

conema / spark-terraform

Sponsor

Star

This project create an Hadoop and Spark cluster on Amazon AWS with Terraform

aws spark hadoop terraform cluster spark-clusters hcl hadoop-cluster

Updated Feb 24, 2021
Shell

nikhilsu / Product-review-analysis-Spark-MongoDB

Star

Performing various product review analysis on Amazon dataset using Apache Spark and MongoDB

spark apache-spark mongodb aws-s3 spark-clusters spark-sql big-data-analytics aws-emr-clusters

Updated Oct 17, 2018
Java

gioenn / sparkutils

Star

A collection of scripts to easily start HDFS and Spark clusters

scripts spark-clusters hdfs-connect xspark

Updated Mar 9, 2017
Shell

reddy-s / spark-container

Star

Docker image to deploy a spark cluster in containers

spark docker-container docker-image spark-clusters spark-master spark-slave

Updated Apr 5, 2018

Chiranjit-B / Japan-Visa-Trend-Analysis-Spark-Azure

Star

A Spark-based data pipeline analyzing Japan's visa data using PySpark, Plotly, and Docker on an Azure-hosted distributed cluster.

data spark-clusters pyspark big-data-analytics data-engineering-pipeline azure-devops

Updated Apr 10, 2025
Python

surbhardwaj / AWS

Star

Stuff done on AWS. Gathered the steps of creating spark cluster on EC2.

spark spark-clusters aws-ec2 spark-ec2 creating automated-cluster-creation cluster-creation-steps aws-cluster sparkcluster

Updated Jun 19, 2018

hypnosapos / sparknetes

Star

Spark on Kubernetes PoCs

kubernetes spark google-storage gcp spark-clusters gke gke-cluster spark-ml spark-on-kubernetes sparknetes

Updated Dec 9, 2019
Makefile

Seyzz / SparkScalaCluster

Star

Research to setup and use a Spark Standalone Multi-Node Cluster.

research spark-clusters

Updated May 10, 2017

kthakore / spark-notebook-dsp-template

Star

Template for Spark Data Science Projects

data-science apache-spark docker-compose notebook spark-clusters

Updated Oct 21, 2017
Makefile

kumarvna / terraform-azurerm-hdinsight

Sponsor

Star

Terraform module to create managed, full-spectrum, open-source analytics service Azure HDInsight. This module creates Apache Hadoop, Apache Spark, Apache HBase, Interactive Query (Apache Hive LLAP) and Apache Kafka clusters.

azure terraform spark-clusters hadoop-cluster azure-hdinsight hadoop-filesystem kafka-cluster spark-cluster terraform-module hadoop-hdfs hdinsight-cluster hbase-cluster hdinsight-hadoop-cluster hdinsight-hbase-cluster hdinsight-interactive-query-cluster hdinsight-kafka-cluster hdinsight-spark-cluster apache-hive-cluster

Updated Jun 8, 2022
HCL

monyedavid / spark-cluster

Star

spark-clusters management with docker

docker kubernetes spark docker-compose spark-clusters

Updated May 11, 2021
Dockerfile

116davinder / spark-cluster-ansible

Star

ansible spark spark-clusters

Updated Jan 13, 2018

Improve this page

Add a description, image, and links to the spark-clusters topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-clusters topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spark-clusters

Here are 18 public repositories matching this topic...

PiercingDan / spark-Jupyter-AWS

radanalyticsio / oshinko-cli

airscholar / Japan-visa-data-engineering

cameres / emr-spark-jupyter

bruler / kub-setup

s8sg / spark-py-submit

conema / spark-terraform

nikhilsu / Product-review-analysis-Spark-MongoDB

gioenn / sparkutils

reddy-s / spark-container

Chiranjit-B / Japan-Visa-Trend-Analysis-Spark-Azure

surbhardwaj / AWS

hypnosapos / sparknetes

Seyzz / SparkScalaCluster

kthakore / spark-notebook-dsp-template

kumarvna / terraform-azurerm-hdinsight

monyedavid / spark-cluster

116davinder / spark-cluster-ansible

Improve this page

Add this topic to your repo