我一直在阅读 Alan Hohn 的《Kubernetes 之书》,并决定尝试让最后一章中描述的应用程序与 Flannel CNI 而不是 Calico 一起工作,这是应用程序默认使用的。但是 Longhorn 存储系统拒绝与 Flannel 一起工作。只有一个longhorn-manager和longhorn-csi-plugin pod 启动(longhorn-manager-c46fd
和longhorn-csi-plugin-nzw5v
),node02
在我提供的示例中,两者都在。longhorn -csi-plugin pod 正在尝试连接到主机系统上名为/csi/csi.sock
(的套接字/var/lib/kubelet/plugins/driver.longhorn.io/csi.sock
),该套接字仅在上可用host02
。
NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
csi-attacher-698d78f74c-5lwfz 0/1 CrashLoopBackOff 50 (4m57s ago) 46h 10.244.5.36 host06 <none> <none>
csi-attacher-698d78f74c-cqdp7 0/1 Error 53 (5m50s ago) 46h 10.244.3.31 host04 <none> <none>
csi-attacher-698d78f74c-t68pd 1/1 Running 51 (5m30s ago) 46h 10.244.4.30 host05 <none> <none>
csi-provisioner-67f89896c9-7twtp 1/1 Running 51 (5m22s ago) 46h 10.244.4.31 host05 <none> <none>
csi-provisioner-67f89896c9-9jlsk 0/1 CrashLoopBackOff 48 (4m39s ago) 46h 10.244.5.34 host06 <none> <none>
csi-provisioner-67f89896c9-tmxdq 0/1 Error 52 (5m42s ago) 46h 10.244.3.32 host04 <none> <none>
csi-resizer-7966f575f5-27s6b 1/1 Running 50 (5m25s ago) 46h 10.244.4.32 host05 <none> <none>
csi-resizer-7966f575f5-67wcq 0/1 CrashLoopBackOff 50 (4m50s ago) 46h 10.244.5.35 host06 <none> <none>
csi-resizer-7966f575f5-zcv8m 0/1 CrashLoopBackOff 51 (12s ago) 46h 10.244.3.35 host04 <none> <none>
csi-snapshotter-845c75b5b6-8rp7f 0/1 CrashLoopBackOff 49 (4m30s ago) 46h 10.244.5.32 host06 <none> <none>
csi-snapshotter-845c75b5b6-trswb 0/1 Error 51 (5m33s ago) 46h 10.244.3.33 host04 <none> <none>
csi-snapshotter-845c75b5b6-z2kzn 1/1 Running 49 (5m16s ago) 46h 10.244.4.33 host05 <none> <none>
engine-image-ei-b0369a5d-6ccqj 1/1 Running 3 (27m ago) 46h 10.244.0.15 host01 <none> <none>
engine-image-ei-b0369a5d-6x486 1/1 Running 3 (26m ago) 46h 10.244.2.15 host03 <none> <none>
engine-image-ei-b0369a5d-fgnns 1/1 Running 3 (25m ago) 46h 10.244.5.30 host06 <none> <none>
engine-image-ei-b0369a5d-qj2xv 1/1 Running 3 (26m ago) 46h 10.244.3.36 host04 <none> <none>
engine-image-ei-b0369a5d-vg5n2 1/1 Running 3 (27m ago) 46h 10.244.1.42 host02 <none> <none>
engine-image-ei-b0369a5d-wlcbv 1/1 Running 3 (25m ago) 46h 10.244.4.36 host05 <none> <none>
instance-manager-37f197c012829700a256d86ef4357100 1/1 Running 0 26m 10.244.1.47 host02 <none> <none>
longhorn-csi-plugin-58wgq 1/3 CrashLoopBackOff 105 (3m14s ago) 45h 10.244.3.29 host04 <none> <none>
longhorn-csi-plugin-k4mmd 1/3 CrashLoopBackOff 104 (3m9s ago) 45h 10.244.5.29 host06 <none> <none>
longhorn-csi-plugin-mnz8z 1/3 CrashLoopBackOff 108 (28s ago) 45h 10.244.2.14 host03 <none> <none>
longhorn-csi-plugin-nzw5v 3/3 Running 52 (12m ago) 46h 10.244.1.45 host02 <none> <none>
longhorn-csi-plugin-r54wv 1/3 CrashLoopBackOff 107 (2m57s ago) 45h 10.244.4.34 host05 <none> <none>
longhorn-csi-plugin-v6b24 1/3 CrashLoopBackOff 108 (39s ago) 45h 10.244.0.13 host01 <none> <none>
longhorn-driver-deployer-56f94fbbd7-psgsk 1/1 Running 17 (25m ago) 46h 10.244.1.44 host02 <none> <none>
longhorn-manager-c46fd 1/1 Running 6 (26m ago) 46h 10.244.1.40 host02 <none> <none>
longhorn-manager-fnp7s 0/1 CrashLoopBackOff 38 (61s ago) 45h 10.244.5.31 host06 <none> <none>
longhorn-manager-jpxrv 0/1 CrashLoopBackOff 38 (97s ago) 45h 10.244.3.34 host04 <none> <none>
longhorn-manager-kzlsm 0/1 CrashLoopBackOff 37 (118s ago) 45h 10.244.0.14 host01 <none> <none>
longhorn-manager-lnmtv 0/1 CrashLoopBackOff 38 (110s ago) 45h 10.244.2.13 host03 <none> <none>
longhorn-manager-s54g6 0/1 CrashLoopBackOff 38 (68s ago) 45h 10.244.4.29 host05 <none> <none>
longhorn-ui-74b6b85c94-7k4v5 1/1 Running 5 (26m ago) 46h 10.244.1.46 host02 <none> <none>
longhorn-ui-74b6b85c94-gfb22 1/1 Running 6 (26m ago) 46h 10.244.1.41 host02 <none> <none>
失败 Pod 的日志
longhorn-manager-fnp7s
W0624 19:29:48.578629 1 client_config.go:618] Neither --kubeconfig nor --master was specified. Using the inClusterConfig. This might not work.
time="2024-06-24T19:29:48Z" level=info msg="Starting longhorn conversion webhook server" func=webhook.StartWebhook file="webhook.go:24"
time="2024-06-24T19:29:48Z" level=info msg="Waiting for conversion webhook to become ready" func=webhook.StartWebhook file="webhook.go:43"
time="2024-06-24T19:29:48Z" level=warning msg="Failed to check endpoint https://localhost:9501/v1/healthz" func=webhook.isServiceAvailable file="webhook.go:78" error="Get \"https://localhost:9501/v1/healthz\": dial tcp [::1]:9501: connect: connection refused"
time="2024-06-24T19:29:48Z" level=info msg="Active TLS secret longhorn-system/longhorn-webhook-tls (ver=5907) (count 2): map[listener.cattle.io/cn-longhorn-admission-webhook.longhor-59584d:longhorn-admission-webhook.longhorn-system.svc listener.cattle.io/cn-longhorn-conversion-webhook.longho-6a0089:longhorn-conversion-webhook.longhorn-system.svc listener.cattle.io/fingerprint:SHA1=A5D6010EC627A8877305D8828712E103B103C5B3]" func="memory.(*memory).Update" file="memory.go:42"
time="2024-06-24T19:29:48Z" level=info msg="Listening on :9501" func=server.ListenAndServe.func2 file="server.go:77"
time="2024-06-24T19:29:48Z" level=info msg="Starting apiregistration.k8s.io/v1, Kind=APIService controller" func="controller.(*controller).run" file="controller.go:144"
time="2024-06-24T19:29:48Z" level=info msg="Starting apiextensions.k8s.io/v1, Kind=CustomResourceDefinition controller" func="controller.(*controller).run" file="controller.go:144"
time="2024-06-24T19:29:48Z" level=info msg="Starting /v1, Kind=Secret controller" func="controller.(*controller).run" file="controller.go:144"
time="2024-06-24T19:29:48Z" level=info msg="Building conversion rules..." func="server.(*WebhookServer).runConversionWebhookListenAndServe.func1" file="server.go:195"
time="2024-06-24T19:29:48Z" level=info msg="Updating TLS secret for longhorn-system/longhorn-webhook-tls (count: 2): map[listener.cattle.io/cn-longhorn-admission-webhook.longhor-59584d:longhorn-admission-webhook.longhorn-system.svc listener.cattle.io/cn-longhorn-conversion-webhook.longho-6a0089:longhorn-conversion-webhook.longhorn-system.svc listener.cattle.io/fingerprint:SHA1=A5D6010EC627A8877305D8828712E103B103C5B3]" func="kubernetes.(*storage).saveInK8s" file="controller.go:225"
time="2024-06-24T19:29:50Z" level=info msg="Started longhorn conversion webhook server on localhost" func=webhook.StartWebhook file="webhook.go:47"
time="2024-06-24T19:29:51Z" level=warning msg="Failed to check endpoint https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz" func=webhook.isServiceAvailable file="webhook.go:78" error="Get \"https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz\": context deadline exceeded (Client.Timeout exceeded while awaiting headers)"
time="2024-06-24T19:29:54Z" level=warning msg="Failed to check endpoint https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz" func=webhook.isServiceAvailable file="webhook.go:78" error="Get \"https://longhorn-conversion-webhook.longhorn-system.svc:9501/v1/healthz\": context deadline exceeded (Client.Timeout exceeded while awaiting headers)"
longhorn-csi-插件-58wgq
I0624 19:28:16.188843 64939 main.go:135] Version: v2.9.2
I0624 19:28:16.188921 64939 main.go:136] Running node-driver-registrar in mode=
I0624 19:28:16.188948 64939 main.go:157] Attempting to open a gRPC connection with: "/csi/csi.sock"
W0624 19:28:26.189744 64939 connection.go:232] Still connecting to unix:///csi/csi.sock
W0624 19:28:36.189205 64939 connection.go:232] Still connecting to unix:///csi/csi.sock
W0624 19:28:46.190515 64939 connection.go:232] Still connecting to unix:///csi/csi.sock
E0624 19:28:46.190535 64939 main.go:160] error connecting to CSI driver: context deadline exceeded
3
–
–
–
|