307 lines
15 KiB
Markdown
307 lines
15 KiB
Markdown
---
|
||
layout: blog
|
||
title: 'KubeDirector:在 Kubernetes 上运行复杂状态应用程序的简单方法'
|
||
date: 2018-10-03
|
||
---
|
||
|
||
<!--
|
||
layout: blog
|
||
title: 'KubeDirector: The easy way to run complex stateful applications on Kubernetes'
|
||
date: 2018-10-03
|
||
-->
|
||
|
||
<!--
|
||
**Author**: Thomas Phelan (BlueData)
|
||
-->
|
||
|
||
**作者**:Thomas Phelan(BlueData)
|
||
|
||
<!--
|
||
KubeDirector is an open source project designed to make it easy to run complex stateful scale-out application clusters on Kubernetes. KubeDirector is built using the custom resource definition (CRD) framework and leverages the native Kubernetes API extensions and design philosophy. This enables transparent integration with Kubernetes user/resource management as well as existing clients and tools.
|
||
-->
|
||
KubeDirector 是一个开源项目,旨在简化在 Kubernetes 上运行复杂的有状态扩展应用程序集群。KubeDirector 使用自定义资源定义(CRD)
|
||
框架构建,并利用了本地 Kubernetes API 扩展和设计哲学。这支持与 Kubernetes 用户/资源 管理以及现有客户端和工具的透明集成。
|
||
|
||
<!--
|
||
We recently [introduced the KubeDirector project](https://medium.com/@thomas_phelan/operation-stateful-introducing-bluek8s-and-kubernetes-director-aa204952f619/), as part of a broader open source Kubernetes initiative we call BlueK8s. I’m happy to announce that the pre-alpha
|
||
code for [KubeDirector](https://github.com/bluek8s/kubedirector/) is now available. And in this blog post, I’ll show how it works.
|
||
-->
|
||
我们最近[介绍了 KubeDirector 项目](https://medium.com/@thomas_phelan/operation-stateful-introducing-bluek8s-and-kubernetes-director-aa204952f619/),作为我们称为 BlueK8s 的更广泛的 Kubernetes 开源项目的一部分。我很高兴地宣布 [KubeDirector](https://github.com/bluek8s/kubedirector/) 的
|
||
pre-alpha 代码现在已经可用。在这篇博客文章中,我将展示它是如何工作的。
|
||
|
||
<!--
|
||
KubeDirector provides the following capabilities:
|
||
-->
|
||
KubeDirector 提供以下功能:
|
||
|
||
<!--
|
||
* The ability to run non-cloud native stateful applications on Kubernetes without modifying the code. In other words, it’s not necessary to decompose these existing applications to fit a microservices design pattern.
|
||
* Native support for preserving application-specific configuration and state.
|
||
* An application-agnostic deployment pattern, minimizing the time to onboard new stateful applications to Kubernetes.
|
||
-->
|
||
|
||
* 无需修改代码即可在 Kubernetes 上运行非云原生有状态应用程序。换句话说,不需要分解这些现有的应用程序来适应微服务设计模式。
|
||
* 本机支持保存特定于应用程序的配置和状态。
|
||
* 与应用程序无关的部署模式,最大限度地减少将新的有状态应用程序装载到 Kubernetes 的时间。
|
||
|
||
<!--
|
||
KubeDirector enables data scientists familiar with data-intensive distributed applications such as Hadoop, Spark, Cassandra, TensorFlow, Caffe2, etc. to run these applications on Kubernetes -- with a minimal learning curve and no need to write GO code. The applications controlled by KubeDirector are defined by some basic metadata and an associated package of configuration artifacts. The application metadata is referred to as a KubeDirectorApp resource.
|
||
-->
|
||
KubeDirector 使熟悉数据密集型分布式应用程序(如 Hadoop、Spark、Cassandra、TensorFlow、Caffe2 等)的数据科学家能够在 Kubernetes 上运行这些应用程序 -- 只需极少的学习曲线,无需编写 GO 代码。由 KubeDirector 控制的应用程序由一些基本元数据和相关的配置工件包定义。应用程序元数据称为 KubeDirectorApp 资源。
|
||
|
||
<!--
|
||
To understand the components of KubeDirector, clone the repository on [GitHub](https://github.com/bluek8s/kubedirector/) using a command similar to:
|
||
-->
|
||
要了解 KubeDirector 的组件,请使用类似于以下的命令在 [GitHub](https://github.com/bluek8s/kubedirector/) 上克隆存储库:
|
||
|
||
```
|
||
git clone http://<userid>@github.com/bluek8s/kubedirector.
|
||
```
|
||
|
||
<!--
|
||
The KubeDirectorApp definition for the Spark 2.2.1 application is located
|
||
in the file `kubedirector/deploy/example_catalog/cr-app-spark221e2.json`.
|
||
-->
|
||
Spark 2.2.1 应用程序的 KubeDirectorApp 定义位于文件 `kubedirector/deploy/example_catalog/cr-app-spark221e2.json` 中。
|
||
|
||
```
|
||
~> cat kubedirector/deploy/example_catalog/cr-app-spark221e2.json
|
||
{
|
||
"apiVersion": "kubedirector.bluedata.io/v1alpha1",
|
||
"kind": "KubeDirectorApp",
|
||
"metadata": {
|
||
"name" : "spark221e2"
|
||
},
|
||
"spec" : {
|
||
"systemctlMounts": true,
|
||
"config": {
|
||
"node_services": [
|
||
{
|
||
"service_ids": [
|
||
"ssh",
|
||
"spark",
|
||
"spark_master",
|
||
"spark_worker"
|
||
],
|
||
…
|
||
```
|
||
|
||
<!--
|
||
The configuration of an application cluster is referred to as a KubeDirectorCluster resource. The
|
||
KubeDirectorCluster definition for a sample Spark 2.2.1 cluster is located in the file
|
||
`kubedirector/deploy/example_clusters/cr-cluster-spark221.e1.yaml`.
|
||
-->
|
||
应用程序集群的配置称为 KubeDirectorCluster 资源。示例 Spark 2.2.1 集群的 KubeDirectorCluster 定义位于文件
|
||
`kubedirector/deploy/example_clusters/cr-cluster-spark221.e1.yaml` 中。
|
||
|
||
```
|
||
~> cat kubedirector/deploy/example_clusters/cr-cluster-spark221.e1.yaml
|
||
apiVersion: "kubedirector.bluedata.io/v1alpha1"
|
||
kind: "KubeDirectorCluster"
|
||
metadata:
|
||
name: "spark221e2"
|
||
spec:
|
||
app: spark221e2
|
||
roles:
|
||
- name: controller
|
||
replicas: 1
|
||
resources:
|
||
requests:
|
||
memory: "4Gi"
|
||
cpu: "2"
|
||
limits:
|
||
memory: "4Gi"
|
||
cpu: "2"
|
||
- name: worker
|
||
replicas: 2
|
||
resources:
|
||
requests:
|
||
memory: "4Gi"
|
||
cpu: "2"
|
||
limits:
|
||
memory: "4Gi"
|
||
cpu: "2"
|
||
- name: jupyter
|
||
…
|
||
```
|
||
|
||
<!--
|
||
## Running Spark on Kubernetes with KubeDirector
|
||
-->
|
||
|
||
## 使用 KubeDirector 在 Kubernetes 上运行 Spark
|
||
|
||
<!--
|
||
With KubeDirector, it’s easy to run Spark clusters on Kubernetes.
|
||
-->
|
||
使用 KubeDirector,可以轻松在 Kubernetes 上运行 Spark 集群。
|
||
|
||
<!--
|
||
First, verify that Kubernetes (version 1.9 or later) is running, using the command `kubectl version`
|
||
-->
|
||
首先,使用命令 `kubectl version` 验证 Kubernetes(版本 1.9 或更高)是否正在运行
|
||
|
||
```
|
||
~> kubectl version
|
||
Client Version: version.Info{Major:"1", Minor:"11", GitVersion:"v1.11.3", GitCommit:"a4529464e4629c21224b3d52edfe0ea91b072862", GitTreeState:"clean", BuildDate:"2018-09-09T18:02:47Z", GoVersion:"go1.10.3", Compiler:"gc", Platform:"linux/amd64"}
|
||
Server Version: version.Info{Major:"1", Minor:"11", GitVersion:"v1.11.3", GitCommit:"a4529464e4629c21224b3d52edfe0ea91b072862", GitTreeState:"clean", BuildDate:"2018-09-09T17:53:03Z", GoVersion:"go1.10.3", Compiler:"gc", Platform:"linux/amd64"}
|
||
```
|
||
|
||
<!--
|
||
Deploy the KubeDirector service and the example KubeDirectorApp resource definitions with the commands:
|
||
-->
|
||
使用以下命令部署 KubeDirector 服务和示例 KubeDirectorApp 资源定义:
|
||
|
||
```
|
||
cd kubedirector
|
||
make deploy
|
||
```
|
||
|
||
<!--
|
||
These will start the KubeDirector pod:
|
||
-->
|
||
这些将启动 KubeDirector pod:
|
||
|
||
```
|
||
~> kubectl get pods
|
||
NAME READY STATUS RESTARTS AGE
|
||
kubedirector-58cf59869-qd9hb 1/1 Running 0 1m
|
||
```
|
||
|
||
<!--
|
||
List the installed KubeDirector applications with `kubectl get KubeDirectorApp`
|
||
-->
|
||
`kubectl get KubeDirectorApp` 列出中已安装的 KubeDirector 应用程序
|
||
|
||
```
|
||
~> kubectl get KubeDirectorApp
|
||
NAME AGE
|
||
cassandra311 30m
|
||
spark211up 30m
|
||
spark221e2 30m
|
||
```
|
||
|
||
<!--
|
||
Now you can launch a Spark 2.2.1 cluster using the example KubeDirectorCluster file and the
|
||
`kubectl create -f deploy/example_clusters/cr-cluster-spark211up.yaml` command.
|
||
Verify that the Spark cluster has been started:
|
||
-->
|
||
现在,您可以使用示例 KubeDirectorCluster 文件和 `kubectl create -f deploy/example_clusters/cr-cluster-spark211up.yaml` 命令
|
||
启动 Spark 2.2.1 集群。验证 Spark 集群已经启动:
|
||
|
||
```
|
||
~> kubectl get pods
|
||
NAME READY STATUS RESTARTS AGE
|
||
kubedirector-58cf59869-djdwl 1/1 Running 0 19m
|
||
spark221e2-controller-zbg4d-0 1/1 Running 0 23m
|
||
spark221e2-jupyter-2km7q-0 1/1 Running 0 23m
|
||
spark221e2-worker-4gzbz-0 1/1 Running 0 23m
|
||
spark221e2-worker-4gzbz-1 1/1 Running 0 23m
|
||
```
|
||
|
||
<!--
|
||
The running services now include the Spark services:
|
||
-->
|
||
现在运行的服务包括 Spark 服务:
|
||
|
||
```
|
||
~> kubectl get service
|
||
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
|
||
kubedirector ClusterIP 10.98.234.194 <none> 60000/TCP 1d
|
||
kubernetes ClusterIP 10.96.0.1 <none> 443/TCP 1d
|
||
svc-spark221e2-5tg48 ClusterIP None <none> 8888/TCP 21s
|
||
svc-spark221e2-controller-tq8d6-0 NodePort 10.104.181.123 <none> 22:30534/TCP,8080:31533/TCP,7077:32506/TCP,8081:32099/TCP 20s
|
||
svc-spark221e2-jupyter-6989v-0 NodePort 10.105.227.249 <none> 22:30632/TCP,8888:30355/TCP 20s
|
||
svc-spark221e2-worker-d9892-0 NodePort 10.107.131.165 <none> 22:30358/TCP,8081:32144/TCP 20s
|
||
svc-spark221e2-worker-d9892-1 NodePort 10.110.88.221 <none> 22:30294/TCP,8081:31436/TCP 20s
|
||
```
|
||
|
||
<!--
|
||
Pointing the browser at port 31533 connects to the Spark Master UI:
|
||
-->
|
||
将浏览器指向端口 31533 连接到 Spark 主节点 UI:
|
||
|
||

|
||
|
||
<!--
|
||
That’s all there is to it!
|
||
In fact, in the example above we also deployed a Jupyter notebook along with the Spark cluster.
|
||
-->
|
||
就是这样!
|
||
事实上,在上面的例子中,我们还部署了一个 Jupyter notebook 和 Spark 集群。
|
||
|
||
<!--
|
||
To start another application (e.g. Cassandra), just specify another KubeDirectorApp file:
|
||
-->
|
||
要启动另一个应用程序(例如 Cassandra),只需指定另一个 KubeDirectorApp 文件:
|
||
|
||
```
|
||
kubectl create -f deploy/example_clusters/cr-cluster-cassandra311.yaml
|
||
```
|
||
|
||
<!--
|
||
See the running Cassandra cluster:
|
||
-->
|
||
查看正在运行的 Cassandra 集群:
|
||
|
||
```
|
||
~> kubectl get pods
|
||
NAME READY STATUS RESTARTS AGE
|
||
cassandra311-seed-v24r6-0 1/1 Running 0 1m
|
||
cassandra311-seed-v24r6-1 1/1 Running 0 1m
|
||
cassandra311-worker-rqrhl-0 1/1 Running 0 1m
|
||
cassandra311-worker-rqrhl-1 1/1 Running 0 1m
|
||
kubedirector-58cf59869-djdwl 1/1 Running 0 1d
|
||
spark221e2-controller-tq8d6-0 1/1 Running 0 22m
|
||
spark221e2-jupyter-6989v-0 1/1 Running 0 22m
|
||
spark221e2-worker-d9892-0 1/1 Running 0 22m
|
||
spark221e2-worker-d9892-1 1/1 Running 0 22m
|
||
```
|
||
|
||
<!--
|
||
Now you have a Spark cluster (with a Jupyter notebook) and a Cassandra cluster running on Kubernetes.
|
||
Use `kubectl get service` to see the set of services.
|
||
-->
|
||
现在,您有一个 Spark 集群(带有 Jupyter notebook )和一个运行在 Kubernetes 上的 Cassandra 集群。
|
||
使用 `kubectl get service` 查看服务集。
|
||
|
||
```
|
||
~> kubectl get service
|
||
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
|
||
kubedirector ClusterIP 10.98.234.194 <none> 60000/TCP 1d
|
||
kubernetes ClusterIP 10.96.0.1 <none> 443/TCP 1d
|
||
svc-cassandra311-seed-v24r6-0 NodePort 10.96.94.204 <none> 22:31131/TCP,9042:30739/TCP 3m
|
||
svc-cassandra311-seed-v24r6-1 NodePort 10.106.144.52 <none> 22:30373/TCP,9042:32662/TCP 3m
|
||
svc-cassandra311-vhh29 ClusterIP None <none> 8888/TCP 3m
|
||
svc-cassandra311-worker-rqrhl-0 NodePort 10.109.61.194 <none> 22:31832/TCP,9042:31962/TCP 3m
|
||
svc-cassandra311-worker-rqrhl-1 NodePort 10.97.147.131 <none> 22:31454/TCP,9042:31170/TCP 3m
|
||
svc-spark221e2-5tg48 ClusterIP None <none> 8888/TCP 24m
|
||
svc-spark221e2-controller-tq8d6-0 NodePort 10.104.181.123 <none> 22:30534/TCP,8080:31533/TCP,7077:32506/TCP,8081:32099/TCP 24m
|
||
svc-spark221e2-jupyter-6989v-0 NodePort 10.105.227.249 <none> 22:30632/TCP,8888:30355/TCP 24m
|
||
svc-spark221e2-worker-d9892-0 NodePort 10.107.131.165 <none> 22:30358/TCP,8081:32144/TCP 24m
|
||
svc-spark221e2-worker-d9892-1 NodePort 10.110.88.221 <none> 22:30294/TCP,8081:31436/TCP 24m
|
||
```
|
||
|
||
<!--
|
||
## Get Involved
|
||
-->
|
||
|
||
## 参与其中
|
||
|
||
<!--
|
||
KubeDirector is a fully open source, Apache v2 licensed, project – the first of multiple open source projects within a broader initiative we call BlueK8s.
|
||
The pre-alpha code for KubeDirector has just been released and we would love for you to join the growing community of developers, contributors, and adopters.
|
||
Follow [@BlueK8s](https://twitter.com/BlueK8s/) on Twitter and get involved through these channels:
|
||
-->
|
||
KubeDirector 是一个完全开放源码的 Apache v2 授权项目 – 在我们称为 BlueK8s 的更广泛的计划中,它是多个开放源码项目中的第一个。
|
||
KubeDirector 的 pre-alpha 代码刚刚发布,我们希望您加入到不断增长的开发人员、贡献者和使用者社区。
|
||
在 Twitter 上关注 [@BlueK8s](https://twitter.com/BlueK8s/),并通过以下渠道参与:
|
||
|
||
<!--
|
||
* KubeDirector [chat room on Slack](https://join.slack.com/t/bluek8s/shared_invite/enQtNDUwMzkwODY5OTM4LTRhYmRmZmE4YzY3OGUzMjA1NDg0MDVhNDQ2MGNkYjRhM2RlMDNjMTI1NDQyMjAzZGVlMDFkNThkNGFjZGZjMGY/)
|
||
* KubeDirector [GitHub repo](https://github.com/bluek8s/kubedirector/)
|
||
-->
|
||
|
||
* KubeDirector [Slack 聊天室](https://join.slack.com/t/bluek8s/shared_invite/enQtNDUwMzkwODY5OTM4LTRhYmRmZmE4YzY3OGUzMjA1NDg0MDVhNDQ2MGNkYjRhM2RlMDNjMTI1NDQyMjAzZGVlMDFkNThkNGFjZGZjMGY/)
|
||
* KubeDirector [GitHub 仓库](https://github.com/bluek8s/kubedirector/)
|