Skip to content

Kubernetes 1.33.x upgrade/fresh install not working #1683

@vgeorgiev69

Description

@vgeorgiev69

/kind bug

What steps did you take and what happened:
[A clear and concise description of what the bug is.]

I've successfully upgraded my clusters from 1.31 to 1.32.9. The problem occurred when upgrading from 1.32.9 to 1.33.x
Did a complete fresh cluster with all of the 1.33.x versions - all of them fail. These are the logs I'm seeing:

Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: E0923 13:12:25.351537    2727 event.go:368] "Unable to write event (may retry after sleeping)" err="Post \"https://91.98.81.42:6443/api/v1/namespaces/default/events\": dial tcp 91.98.81.42:6443: connect: connection refused" event="&Event{ObjectMeta:{test-control-plane-thfpm.1867eb757653ac72  default    0 0001-01-01 00:00:00 +0000 UTC <nil> <nil> map[] map[] [] [] []},InvolvedObject:ObjectReference{Kind:Node,Namespace:,Name:test-control-plane-thfpm,UID:test-control-plane-thfpm,APIVersion:,ResourceVersion:,FieldPath:,},Reason:Starting,Message:Starting kubelet.,Source:EventSource{Component:kubelet,Host:test-control-plane-thfpm,},FirstTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,LastTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,Count:1,Type:Normal,EventTime:0001-01-01 00:00:00 +0000 UTC,Series:nil,Action:,Related:nil,ReportingController:kubelet,ReportingInstance:test-control-plane-thfpm,}"
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: E0923 13:12:25.369536    2727 kubelet.go:3305] "No need to create a mirror pod, since failed to get node info from the cluster" err="node \"test-control-plane-thfpm\" not found" node="test-control-plane-thfpm"
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: I0923 13:12:25.369693    2727 scope.go:117] "RemoveContainer" containerID="a8a8fc1ad37a7fc7b0d281d01c4bbda500edc94edef19d4614bd3a6625671aa9"
Sep 23 13:12:25 test-control-plane-thfpm kubelet[2727]: E0923 13:12:25.369947    2727 pod_workers.go:1301] "Error syncing pod, skipping" err="failed to \"StartContainer\" for \"kube-apiserver\" with CrashLoopBackOff: \"back-off 40s restarting failed container=kube-apiserver pod=kube-apiserver-test-control-plane-thfpm_kube-system(41c00f3f721fc8b46f670503139a929d)\"" pod="kube-system/kube-apiserver-test-control-plane-thfpm" podUID="41c00f3f721fc8b46f670503139a929d"
Sep 23 13:12:28 test-control-plane-thfpm kubelet[2727]: E0923 13:12:28.289761    2727 reflector.go:200] "Failed to watch" err="failed to list *v1.CSIDriver: Get \"https://91.98.81.42:6443/apis/storage.k8s.io/v1/csidrivers?limit=500&resourceVersion=0\": dial tcp 91.98.81.42:6443: connect: connection refused" logger="UnhandledError" reflector="k8s.io/client-go/informers/factory.go:160" type="*v1.CSIDriver"
Sep 23 13:12:28 test-control-plane-thfpm kubelet[2727]: E0923 13:12:28.369017    2727 kubelet.go:3305] "No need to create a mirror pod, since failed to get node info from the cluster" err="node \"test-control-plane-thfpm\" not found" node="test-control-plane-thfpm"
Sep 23 13:12:29 test-control-plane-thfpm kubelet[2727]: E0923 13:12:29.328745    2727 eviction_manager.go:292] "Eviction manager: failed to get summary stats" err="failed to get node info: node \"test-control-plane-thfpm\" not found"
Sep 23 13:12:31 test-control-plane-thfpm kubelet[2727]: E0923 13:12:31.869714    2727 controller.go:145] "Failed to ensure lease exists, will retry" err="Get \"https://91.98.81.42:6443/apis/coordination.k8s.io/v1/namespaces/kube-node-lease/leases/test-control-plane-thfpm?timeout=10s\": dial tcp 91.98.81.42:6443: connect: connection refused" interval="7s"
Sep 23 13:12:32 test-control-plane-thfpm kubelet[2727]: I0923 13:12:32.057992    2727 kubelet_node_status.go:75] "Attempting to register node" node="test-control-plane-thfpm"
Sep 23 13:12:32 test-control-plane-thfpm kubelet[2727]: E0923 13:12:32.058555    2727 kubelet_node_status.go:107] "Unable to register node with API server" err="Post \"https://91.98.81.42:6443/api/v1/nodes\": dial tcp 91.98.81.42:6443: connect: connection refused" node="test-control-plane-thfpm"
Sep 23 13:12:35 test-control-plane-thfpm kubelet[2727]: E0923 13:12:35.352320    2727 event.go:368] "Unable to write event (may retry after sleeping)" err="Post \"https://91.98.81.42:6443/api/v1/namespaces/default/events\": dial tcp 91.98.81.42:6443: connect: connection refused" event="&Event{ObjectMeta:{test-control-plane-thfpm.1867eb757653ac72  default    0 0001-01-01 00:00:00 +0000 UTC <nil> <nil> map[] map[] [] [] []},InvolvedObject:ObjectReference{Kind:Node,Namespace:,Name:test-control-plane-thfpm,UID:test-control-plane-thfpm,APIVersion:,ResourceVersion:,FieldPath:,},Reason:Starting,Message:Starting kubelet.,Source:EventSource{Component:kubelet,Host:test-control-plane-thfpm,},FirstTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,LastTimestamp:2025-09-23 13:11:09.240364146 +0000 UTC m=+0.080328188,Count:1,Type:Normal,EventTime:0001-01-01 00:00:00 +0000 UTC,Series:nil,Action:,Related:nil,ReportingController:kubelet,ReportingInstance:test-control-plane-thfpm,}"

What did you expect to happen:
To have 1.33.x working as the previous versions.

Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]

Environment:

  • cluster-api-provider-hetzner version: 1.0.6, the rest are 1.10.6
  • Kubernetes version: (use kubectl version) Client Version: v1.32.1 / Kustomize Version: v5.5.0 / Server Version: v1.32.9
  • OS (e.g. from /etc/os-release): Ubuntu 22 and Ubuntu 24.04

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions