For more details about this project please refer to my article where I explain the motivations and how to recreate it by yourself.
├── docker-compose.yml
├── Dockerfile
├── confs
│ ├── config
│ ├── core-site.xml
│ ├── hdfs-site.xml
│ ├── mapred-site.xml
│ ├── requirements.req
│ ├── slaves
│ ├── spark-defaults.conf
│ └── yarn-site.xml
├── datasets
│ ├── alice_in_wonderland.txt
│ └── iris.csv
├── notebooks
│ ├── Bash-Interface.ipynb
│ ├── Dask-Yarn.ipynb
│ ├── Python-Spark.ipynb
│ └── Scala-Spark.ipynb
└── script_files
└── bootstrap.sh
docker build . -t cluster-base
docker-compose up -d
Access the Yarn resource manager UI using the following link : http://localhost:8088/cluster/nodes
Access Jupyter Notebook using this link : http://localhost:8888/
docker-compose down