As a block node operator
I want to have a set of P1 plugin operation and health metrics
To ensure that I can observe and alert on the health of my BN
- Note: As part of docs a suggestion of a subset of alerts must be provided
Technical Notes
This requires
- on a deployed BN (dev or any other env) viewing flowing metrics
- identifying the top 1-3 metrics for each P1 plugins and server health
- capturing the list and providing a suggestion on 1-5 metrics that should be alerted on with suggested thresholds