-
Notifications
You must be signed in to change notification settings - Fork 126
Open
Labels
area/control-planeControl plane relatedControl plane relatedkind/bugBugBugpriority/1Priority (lower number equals higher priority)Priority (lower number equals higher priority)
Description
How to categorize this issue?
/area control-plane
/kind bug
/priority 1
What happened:
We had a live, scalability issue where due to invalid credentials, the etcd
database was filled up.
The machine-controller-manager
was continuing updating MachineDeployments
and MachineSets
. The MachineDeployment
status contains entry for each Machine
and its lastError
.
(issues-canary/issues/7190 internally)
What you expected to happen:
machine-controller-manager
should adhere to controller best practices such as exponential backoff and skipping no-op (status) updates if there is no change in the status.
How to reproduce it (as minimally and precisely as possible):
- Use the virtual mcm provider and local api-server and etcd to simulate credential failure for large number of machines (> 1000)
- Check size of etcd db.
Anything else we need to know?:
ialidzhikovaaronfern
Metadata
Metadata
Assignees
Labels
area/control-planeControl plane relatedControl plane relatedkind/bugBugBugpriority/1Priority (lower number equals higher priority)Priority (lower number equals higher priority)