-
Notifications
You must be signed in to change notification settings - Fork 117
Docs improvements #176
Docs improvements #176
Changes from 1 commit
0094929
b9913c2
bcb779b
5c7e787
0abf312
37239f3
d9f4eb9
738f791
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
- Loading branch information
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,25 @@ | ||
| --- | ||
| layout: global | ||
| title: Running Spark on the cloud with Kubernetes | ||
| --- | ||
|
|
||
| For general information about running Spark on Kubernetes, refer to [this section](running-on-kubernetes.md). | ||
|
|
||
| A Kubernetes cluster may be brought up on different cloud providers or on premise. It is commonly provisioned through [Google Container Engine](https://cloud.google.com/container-engine/), or using [kops](https://github.com/kubernetes/kops) on AWS, or on premise using [kubeadm](https://kubernetes.io/docs/getting-started-guides/kubeadm/). | ||
|
|
||
| ## Running on Google Container Engine (GKE) | ||
|
|
||
| * Create a GKE [container cluster](https://cloud.google.com/container-engine/docs/clusters/operations). | ||
| * Find the name of the master associated with this project. | ||
|
|
||
| > kubectl cluster-info | ||
| Kubernetes master is running at https://x.y.z.w:443 | ||
| * Run spark-submit with the master option set to `k8s://https://x.y.z.w:443`. The instructions for running spark-submit are provided in the [running on kubernetes](running-on-kubernetes.md) tutorial. | ||
| * Check that your driver pod, and subsequently your executor pods are launched using `kubectl get pods`. | ||
| * Read the stdout and stderr of the driver pod using `kubectl get logs`. | ||
|
||
|
|
||
| Known issues: | ||
| * If you face OAuth token expiry errors when you run spark-submit, it is likely because the token needs to be refreshed. The easiest way to fix this is to run any `kubectl` command, say, `kubectl version` and then retry your submission. | ||
|
|
||
|
|
||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -89,6 +89,8 @@ the submitting machine, and are uploaded to the driver running in Kubernetes bef | |
|
|
||
| ### Accessing Kubernetes Clusters | ||
|
|
||
| For details about running on public cloud environments, such as Google Container Engine (GKE), please refer to [our documentation](running-on-kubernetes-cloud.md). | ||
|
||
|
|
||
| Spark-submit also supports submission through the | ||
| [local kubectl proxy](https://kubernetes.io/docs/user-guide/accessing-the-cluster/#using-kubectl-proxy). One can use the | ||
| authenticating proxy to communicate with the api server directly without passing credentials to spark-submit. | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit:
https://<master-ip>:443