Skip to content

Share logic around effective status codes and bad captures better #175

@Mr0grog

Description

@Mr0grog

Various projects that are part of the web monitoring ecosystem have grown overlapping approaches to two related problems:

  1. Determining the effective status code of a capture (e.g. the server said 200, but it’s really a 404 page).
  2. Indicating that a capture might be bad (either the crawler got blocked or it was just an intermittent server error & retrying would have gotten a better response). We want to avoid pushing these captures in peoples’ faces.

There’s already some copy-pasting or porting of largely identical logic between things. In other places, we have wildly varying approaches that maybe should be more similar. If possible, we should figure out a strategy for sharing implementations, or at least aligning them so it’s clear when and where you should port work in one place to a copy in another.

Some example spots:

Metadata

Metadata

Assignees

No one assigned

    Type

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions