This chart used to deploy Big Data environment in Kubernetes Cluster. whether on local or on production state.
This chart will do the following:
- 1 x Spark Master with port 8080 exposed on an external LoadBalancer
- 3 x Spark Workers with HorizontalPodAutoscaler to scale to max 10 pods when CPU hits 50% of 100m
- 1 x Hadoop Datanode
- 1 x Hadoop NameNode
- 1 x Yarn Resource Manager
- 1 x Yarn NodeManager
- All using Kubernetes Deployments
Alternatively, a YAML file that specifies the values for the parameters can be provided while installing the chart. For example,
$ helm package .
$ helm install gamabox--name gamabox -f values.yaml
Tip: You can use the default values.yaml