First idea in adding histogram metric #17894

andsel · 2025-08-01T08:35:28Z

Release notes

[rn:skip]

What does this PR do?

Create a new histogram metric type. Updated all metric infrastructure code, the core class is HistogramMetric which uses HdrHistogram Recorder to track the measurements and produces a new HistogramSnapshot every time getValue is invoked, clearing the measurements stored in the Recorder.
HistogramSnapshot is a data class that exposes 75Percentile and 90Percentile (at the moment).

To verify the effectiveness of this, the memory queue read client was updated to expose a metric to track the batch size.

Created a new setting named pipeline.batch.metrics with values "true" and "false" (at the moment), to enable and disable the computation of such metrics. This setting is a string because in a follow up PR will become a tri-state flag.

Why is it important/What is the impact to the user?

This is an intermediate step, it has to proof the exposition of new metric section under the _node/stats API endpoint like:

"pipelines": {
    "main": {
      ...
      "batch": {
        "event_size": {
          "p75": 12.4
          "p90": 13.1
        }
    }
  }
}

Percentiles are also pushed down to ES when leveraging the monitoring.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
~~[ ] I have made corresponding changes to the documentation~~
~~[ ] I have made corresponding change to the default configuration files (and/or docker env variables)~~
I have added tests that prove my fix is effective or that my feature works

Author's Checklist

Check with xpack monitoring

How to test this PR locally

Run Logstash with the setting pipeline.batch.metrics with values "true" and verify that node stats exposes the batch_size histogram.

curl http://localhost:9600/_node/stats | jq .pipelines.main.events

Related issues

Use cases

Screenshots

Logs

github-actions · 2025-08-01T08:35:36Z

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

mergify · 2025-08-01T08:36:03Z

This pull request does not have a backport label. Could you fix it @andsel? 🙏
To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-8./d is the label to automatically backport to the 8./d branch. /d is the digit.
If no backport is necessary, please add the backport-skip label

andsel · 2025-08-12T09:00:02Z

logstash-core/src/main/java/org/logstash/instrument/metrics/histogram/HistogramSnapshot.java

+/**
+ * Class to expose percentiles retrieved from an HdrHistogram.
+ * */
+public final class HistogramSnapshot implements Serializable {


Note for reviewer

Implements Serializable so that Valuefier can use the identity converter.

andsel · 2025-08-12T09:12:53Z

logstash-core/src/main/java/org/logstash/Rubyfier.java

            )
        );
        converters.put(SecretVariable.class, JAVAUTIL_CONVERTER);
+        converters.put(HistogramSnapshot.class, JAVAUTIL_CONVERTER);


Note for reviewer

The x-pack monitoring pipeline create Logstash events which contains this snapshot, and need to be converted to Ruby object in the Rubyfier.deep method.

…te metrics inside, for example, the queue reader client

…hat in case no collector is provided it return safely

…erted by the Valuefier

…on of batch size related metrics into histrograms

…ze metrics

…nd spread around to reach in memory queue client and control the batch size metrics. Covered with tests the readBatch code to verify the effectiveness of the flag

…ch_size->[75Percentile, 90Percentile] to {pipeline_name}->batch->event_count->[p75, p90]

elastic-sonarqube · 2025-08-14T06:46:48Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarQube

elasticmachine · 2025-08-14T06:59:27Z

💛 Build succeeded, but was flaky

Buildkite Build
Commit: edcf61c

Failed CI Steps

🥼 x-pack unit tests - FIPS mode

History

💔 Build #3300 failed d028816
💚 Build #3299 succeeded 1e27899
💚 Build #3295 succeeded 506c59c
💚 Build #3288 succeeded 66cd2b3
💚 Build #3287 succeeded 5932065

cc @andsel

andsel self-assigned this Aug 1, 2025

andsel added the enhancement label Aug 1, 2025

andsel mentioned this pull request Aug 1, 2025

Implement average lifetime long batch's size and document count metric #17892

Closed

3 tasks

andsel mentioned this pull request Aug 1, 2025

Create new histogram metric to expose batch size percentiles amongst a time windows (1 min, 5 min, 15min, 1h, 1d) #17895

Open

1 task

andsel force-pushed the feature/introduce_histogram_metric branch 2 times, most recently from 0ebad0f to a199631 Compare August 6, 2025 13:03

andsel commented Aug 12, 2025

View reviewed changes

andsel added 15 commits August 13, 2025 09:55

First idea in adding histogram metric

5f649df

Updated licenses after HdrHistogram addition

e51ac48

Created a test mocking class for namespaced metric to be used to crea…

2b6f485

…te metrics inside, for example, the queue reader client

Fixed test, missed metric passed into JavaPipeline instantiation

0ba3776

Update initialisation of histogram metric for batch queue reader so t…

8f88448

…hat in case no collector is provided it return safely

If histogram metric is not assignable then create an empty dummy

627a9b6

Made HistogramSnapshot to implement Serializable interface to be conv…

1973503

…erted by the Valuefier

Minor, remove commented code

412a266

Aligned metric docuemnt's JSON schema sent to ES

4c82149

Updated Rubifier to be able to convert also HistogramSnapshot

ce7a920

Minor, removed commented code

8aae756

Added setting 'pipeline.batch.metrics' to enable/disable the collecti…

d01c9ae

…on of batch size related metrics into histrograms

[Test] Updated MockNamespacedMetric to pool metrics instances

4404735

[Test] Added test to verify that queue client reader updates batch si…

f34e8b4

…ze metrics

Decoded 'pipeline.batch.metrics' setting into BatchSizeSamplingType a…

1e27899

…nd spread around to reach in memory queue client and control the batch size metrics. Covered with tests the readBatch code to verify the effectiveness of the flag

andsel force-pushed the feature/introduce_histogram_metric branch from 506c59c to 1e27899 Compare August 13, 2025 08:20

andsel added 2 commits August 13, 2025 18:16

Reshaped batch size metric response from {pipeline-name}->events->bat…

d028816

…ch_size->[75Percentile, 90Percentile] to {pipeline_name}->batch->event_count->[p75, p90]

[Test] fixed monitoring schema definition, missed a closing curly

edcf61c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

First idea in adding histogram metric #17894

First idea in adding histogram metric #17894

Uh oh!

andsel commented Aug 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 1, 2025

Uh oh!

mergify bot commented Aug 1, 2025

Uh oh!

andsel Aug 12, 2025 •

edited

Loading

Uh oh!

andsel Aug 12, 2025

Uh oh!

elastic-sonarqube bot commented Aug 14, 2025

Uh oh!

elasticmachine commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

First idea in adding histogram metric #17894

Are you sure you want to change the base?

First idea in adding histogram metric #17894

Uh oh!

Conversation

andsel commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release notes

What does this PR do?

Why is it important/What is the impact to the user?

Checklist

Author's Checklist

How to test this PR locally

Related issues

Use cases

Screenshots

Logs

Uh oh!

github-actions bot commented Aug 1, 2025

🤖 GitHub comments

Uh oh!

mergify bot commented Aug 1, 2025

Uh oh!

andsel Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andsel Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

elastic-sonarqube bot commented Aug 14, 2025

Quality Gate passed

Uh oh!

elasticmachine commented Aug 14, 2025

💛 Build succeeded, but was flaky

Failed CI Steps

History

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andsel commented Aug 1, 2025 •

edited

Loading

andsel Aug 12, 2025 •

edited

Loading