You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: bindata/assets/alerts/cpu-utilization.yaml
+5-8Lines changed: 5 additions & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -10,18 +10,15 @@ spec:
10
10
- alert: HighOverallControlPlaneCPU
11
11
annotations:
12
12
summary: >-
13
-
CPU utilization across all control plane nodes is more than 60% of the total available CPU. Control plane node outage may cause a cascading failure; increase available CPU.
13
+
CPU utilization across all three control plane nodes is higher than two control plane nodes can sustain; a single control plane node outage may
14
+
cause a cascading failure; increase available CPU.
On a multi-node cluster with three control plane nodes, the overall CPU utilization may only be about 2/3 of all available capacity.
17
+
Given three control plane nodes, the overall CPU utilization may only be about 2/3 of all available capacity.
17
18
This is because if a single control plane node fails, the remaining two must handle the load of the cluster in order to be HA.
18
-
If the cluster is using more than 2/3 of all capacity, if one control plane node fails, the remaining two are likely to fail when they take the load.
19
+
If the cluster is using more than 2/3 of all capacity, if one control plane node fails, the remaining two are likely to
20
+
fail when they take the load.
19
21
To fix this, increase the CPU and memory on your control plane nodes.
20
-
21
-
On a single node OpenShift (SNO) cluster, this alert will also fire if the 2/3 of the CPU cores of the node are in use by any workload. This level of CPU utlization
22
-
of an SNO cluster is probably not a problem under most circumstances, but high levels of utilization may result in degraded performance.
23
-
To manage this alert or silence it in case of false positives see the following link:
0 commit comments