Skip to content

garm webhook && metrics/o11y #272

@pathcl

Description

@pathcl

Hello folks,

One the challenges about runners and github actions after years it's still observability.

I'd like to know if we have plans to work on o11y for garm's webhook.

func (a *APIController) handleWorkflowJobEvent(ctx context.Context, w http.ResponseWriter, r *http.Request) {

Use case(s)

  • If there's a stuck workflow because of a failed runner/provider. I know we have a timeout for bootstrap
  • What's the P99/P90 for jobs&runners, startup time
  • Get better insights about jobs. It should be possible to log/report about webhook events.
  • Github actions doesn't provide a retry-mechanism. How do we cope with it?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions