Skip to content

Can nightly charts somehow separate test-only changes from actual Lucene code changes? #454

@mikemccand

Description

@mikemccand

Nightly charts like the KNN indexing chart look depressing. But when you look closer, you realize that many of the big plummets in indexing throughput are due to benchy changes like switching to harder or higher dimensionality vectors, increasing topK, Lucene increasing default beamWidth, etc.

Could we somehow alter the chart to normalize away such changes?

They are similar to how financial charts normalize away stock splits, so you aren't visually deceived into thinking your stock actually dropped or jumped 50% on some days.

If we took such an approach, maybe with a checkbox to switch between "real" and "normalized", only the most recent data point would have trustworthy actual GB/hour, and all values from the past might be a normalized/relative value.

Maybe the annotations that are test-only could be a different color or font?

As we add annotations, we could set an additional optional boolean when it is a test only change. But it might be hard to categorize ... e.g. if Lucene changes its defaults, like recent TieredMergePolicy defaults changes, which would that be?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions