-
Notifications
You must be signed in to change notification settings - Fork 85
Open
Description
/kind bug
What steps did you take and what happened:
[A clear and concise description of what the bug is.]
I've successfully upgraded my clusters from 1.31 to 1.32.9. The problem occurred when upgrading from 1.32.9 to 1.33.x
Did a complete fresh cluster with all of the 1.33.x versions - all of them fail. These are the logs I'm seeing:
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: E0923 13:12:25.351537 2727 event.go:368] "Unable to write event (may retry after sleeping)" err="Post \"https://91.98.81.42:6443/api/v1/namespaces/default/events\": dial tcp 91.98.81.42:6443: connect: connection refused" event="&Event{ObjectMeta:{test-control-plane-thfpm.1867eb757653ac72 default 0 0001-01-01 00:00:00 +0000 UTC <nil> <nil> map[] map[] [] [] []},InvolvedObject:ObjectReference{Kind:Node,Namespace:,Name:test-control-plane-thfpm,UID:test-control-plane-thfpm,APIVersion:,ResourceVersion:,FieldPath:,},Reason:Starting,Message:Starting kubelet.,Source:EventSource{Component:kubelet,Host:test-control-plane-thfpm,},FirstTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,LastTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,Count:1,Type:Normal,EventTime:0001-01-01 00:00:00 +0000 UTC,Series:nil,Action:,Related:nil,ReportingController:kubelet,ReportingInstance:test-control-plane-thfpm,}"
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: E0923 13:12:25.369536 2727 kubelet.go:3305] "No need to create a mirror pod, since failed to get node info from the cluster" err="node \"test-control-plane-thfpm\" not found" node="test-control-plane-thfpm"
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: I0923 13:12:25.369693 2727 scope.go:117] "RemoveContainer" containerID="a8a8fc1ad37a7fc7b0d281d01c4bbda500edc94edef19d4614bd3a6625671aa9"
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: E0923 13:12:25.369947 2727 pod_workers.go:1301] "Error syncing pod, skipping" err="failed to \"StartContainer\" for \"kube-apiserver\" with CrashLoopBackOff: \"back-off 40s restarting failed container=kube-apiserver pod=kube-apiserver-test-control-plane-thfpm_kube-system(41c00f3f721fc8b46f670503139a929d)\"" pod="kube-system/kube-apiserver-test-control-plane-thfpm" podUID="41c00f3f721fc8b46f670503139a929d"
Sep 23 13:12:28 test-control-plane-thfpm kubelet[2727]: E0923 13:12:28.289761 2727 reflector.go:200] "Failed to watch" err="failed to list *v1.CSIDriver: Get \"https://91.98.81.42:6443/apis/storage.k8s.io/v1/csidrivers?limit=500&resourceVersion=0\": dial tcp 91.98.81.42:6443: connect: connection refused" logger="UnhandledError" reflector="k8s.io/client-go/informers/factory.go:160" type="*v1.CSIDriver"
Sep 23 13:12:28 test-control-plane-thfpm kubelet[2727]: E0923 13:12:28.369017 2727 kubelet.go:3305] "No need to create a mirror pod, since failed to get node info from the cluster" err="node \"test-control-plane-thfpm\" not found" node="test-control-plane-thfpm"
Sep 23 13:12:29 test-control-plane-thfpm kubelet[2727]: E0923 13:12:29.328745 2727 eviction_manager.go:292] "Eviction manager: failed to get summary stats" err="failed to get node info: node \"test-control-plane-thfpm\" not found"
Sep 23 13:12:31 test-control-plane-thfpm kubelet[2727]: E0923 13:12:31.869714 2727 controller.go:145] "Failed to ensure lease exists, will retry" err="Get \"https://91.98.81.42:6443/apis/coordination.k8s.io/v1/namespaces/kube-node-lease/leases/test-control-plane-thfpm?timeout=10s\": dial tcp 91.98.81.42:6443: connect: connection refused" interval="7s"
Sep 23 13:12:32 test-control-plane-thfpm kubelet[2727]: I0923 13:12:32.057992 2727 kubelet_node_status.go:75] "Attempting to register node" node="test-control-plane-thfpm"
Sep 23 13:12:32 test-control-plane-thfpm kubelet[2727]: E0923 13:12:32.058555 2727 kubelet_node_status.go:107] "Unable to register node with API server" err="Post \"https://91.98.81.42:6443/api/v1/nodes\": dial tcp 91.98.81.42:6443: connect: connection refused" node="test-control-plane-thfpm"
Sep 23 13:12:35 test-control-plane-thfpm kubelet[2727]: E0923 13:12:35.352320 2727 event.go:368] "Unable to write event (may retry after sleeping)" err="Post \"https://91.98.81.42:6443/api/v1/namespaces/default/events\": dial tcp 91.98.81.42:6443: connect: connection refused" event="&Event{ObjectMeta:{test-control-plane-thfpm.1867eb757653ac72 default 0 0001-01-01 00:00:00 +0000 UTC <nil> <nil> map[] map[] [] [] []},InvolvedObject:ObjectReference{Kind:Node,Namespace:,Name:test-control-plane-thfpm,UID:test-control-plane-thfpm,APIVersion:,ResourceVersion:,FieldPath:,},Reason:Starting,Message:Starting kubelet.,Source:EventSource{Component:kubelet,Host:test-control-plane-thfpm,},FirstTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,LastTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,Count:1,Type:Normal,EventTime:0001-01-01 00:00:00 +0000 UTC,Series:nil,Action:,Related:nil,ReportingController:kubelet,ReportingInstance:test-control-plane-thfpm,}"
What did you expect to happen:
To have 1.33.x working as the previous versions.
Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]
Environment:
- cluster-api-provider-hetzner version: 1.0.6, the rest are 1.10.6
- Kubernetes version: (use
kubectl version
) Client Version: v1.32.1 / Kustomize Version: v5.5.0 / Server Version: v1.32.9 - OS (e.g. from
/etc/os-release
): Ubuntu 22 and Ubuntu 24.04
Metadata
Metadata
Assignees
Labels
No labels