spark-cluster-mode-on-k8s
https://spark.apache.org/docs/latest/cluster-overview.html
https://spark.apache.org/docs/latest/cluster-overview.html#glossary
https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/examples/spark-pi.yaml
https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/examples/spark-py-pi.yaml
KUBERNETES_SERVICE_HOST
KUBERNETES_SERVICE_PORT
1 | --class=... |
SPARK_HOME/bin/spark-submit args
Spark-on-k8s-operator controller run the spark-submit
scripts
https://spark.apache.org/docs/2.3.1/running-on-kubernetes.html
https://spark.apache.org/docs/2.3.1/running-on-kubernetes.html#cluster-mode
1 | $ bin/spark-submit \ |
https://spark.apache.org/docs/2.3.1/running-on-kubernetes.html#dependency-management
client mode is not supported in 2.3.1 (cluster manager)
but it seems work in 2.4
https://spark.apache.org/docs/latest/running-on-kubernetes.html
Dependencies Jars and Files
logs
1 | kubectl -n=<namespace> logs -f <driver-pod-name> |
https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/docs/design.md