release-25.3: pkg/util/log: Implement handling for oversized log messages in bufferedSink #158259

blathers-crl · 2025-11-24T08:01:54Z

Backport 1/1 commits from #157964 on behalf of @Abhinav1299.

Previously, when a single log message exceeded the configured max-buffer-size
for a buffered sink with exit-on-error enabled, the error would propagate up
and trigger process termination. This was overly aggressive for what amounts
to a logging configuration issue - a single oversized SQL query (e.g., with a
multi-megabyte string literal) could crash an entire CockroachDB node.

This commit modifies bufferedSink.output() to detect the errMsgTooLarge error
and handle it gracefully. When an oversized message is encountered, instead of
propagating the error, we drop the message and log a warning via Ops.Warningf()
indicating that the message exceeded the buffer size limit. This allows the
node to continue operating normally while still providing visibility into the
issue through logged warnings.

The implementation uses a two-phase approach to avoid deadlock: first, while
holding the sink's mutex, we detect the oversized message and set a flag with
the relevant information; then, after releasing the lock, we emit the warning.
This is necessary because calling Ops.Warningf() while holding the mutex would
cause the warning message to attempt re-entry into the same sink, resulting in
a deadlock when it tries to acquire the already-held lock.

This resolves #152635

Part of: CRDB-53951
Epic: CRDB-56325
Release note: None

Release justification: This PR fixes the CRDB process termination when a log message greater than max-buffer-size of a sink is encountered when exit-on-error flag is enabled.

…edSink Previously, when a single log message exceeded the configured max-buffer-size for a buffered sink with exit-on-error enabled, the error would propagate up and trigger process termination. This was overly aggressive for what amounts to a logging configuration issue - a single oversized SQL query (e.g., with a multi-megabyte string literal) could crash an entire CockroachDB node. This commit modifies bufferedSink.output() to detect the errMsgTooLarge error and handle it gracefully. When an oversized message is encountered, instead of propagating the error, we drop the message and log a warning via Ops.Warningf() indicating that the message exceeded the buffer size limit. This allows the node to continue operating normally while still providing visibility into the issue through logged warnings. The implementation uses a two-phase approach to avoid deadlock: first, while holding the sink's mutex, we detect the oversized message and set a flag with the relevant information; then, after releasing the lock, we emit the warning. This is necessary because calling Ops.Warningf() while holding the mutex would cause the warning message to attempt re-entry into the same sink, resulting in a deadlock when it tries to acquire the already-held lock. Part of: CRDB-53951 Epic: CRDB-56325 Release note: None

blathers-crl · 2025-11-24T08:01:58Z

Thanks for opening a backport.

Before merging, please confirm that the change does not break backwards compatibility and otherwise complies with the backport policy. Include a brief release justification in the PR description explaining why the backport is appropriate. All backports must be reviewed by the TL for the owning area. While the stricter LTS policy does not yet apply, please exercise judgment and consider gating non-critical changes behind a disabled-by-default feature flag when appropriate.

cockroach-teamcity · 2025-11-24T08:02:28Z

This change is

dhartunian

Release justification: this is a low risk improvement to our error behavior for critical log sinks that can cause unnecessary node crashes.

blathers-crl bot force-pushed the blathers/backport-release-25.3-157964 branch from 950651d to 5df6d3b Compare November 24, 2025 08:01

blathers-crl bot requested review from a team as code owners November 24, 2025 08:01

blathers-crl bot added blathers-backport This is a backport that Blathers created automatically. O-robot Originated from a bot. labels Nov 24, 2025

blathers-crl bot requested review from angles-n-daemons and removed request for a team November 24, 2025 08:01

blathers-crl bot assigned Abhinav1299 Nov 24, 2025

blathers-crl bot requested review from Abhinav1299, aa-joshi and arjunmahishi and removed request for a team November 24, 2025 08:01

blathers-crl bot requested review from dhartunian and kyle-a-wong November 24, 2025 08:01

blathers-crl bot added backport Label PR's that are backports to older release branches T-supportability labels Nov 24, 2025

dhartunian approved these changes Nov 26, 2025

View reviewed changes

Abhinav1299 merged commit f8526a3 into release-25.3 Nov 26, 2025
15 checks passed

Abhinav1299 deleted the blathers/backport-release-25.3-157964 branch November 26, 2025 18:55

celeste-cockroachdb bot added the target-release-25.3.7 label Nov 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

release-25.3: pkg/util/log: Implement handling for oversized log messages in bufferedSink #158259

release-25.3: pkg/util/log: Implement handling for oversized log messages in bufferedSink #158259

Uh oh!

blathers-crl bot commented Nov 24, 2025 •

edited by Abhinav1299

Loading

Uh oh!

blathers-crl bot commented Nov 24, 2025

Uh oh!

cockroach-teamcity commented Nov 24, 2025

Uh oh!

dhartunian left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

release-25.3: pkg/util/log: Implement handling for oversized log messages in bufferedSink #158259

release-25.3: pkg/util/log: Implement handling for oversized log messages in bufferedSink #158259

Uh oh!

Conversation

blathers-crl bot commented Nov 24, 2025 • edited by Abhinav1299 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

blathers-crl bot commented Nov 24, 2025

Uh oh!

cockroach-teamcity commented Nov 24, 2025

Uh oh!

dhartunian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

blathers-crl bot commented Nov 24, 2025 •

edited by Abhinav1299

Loading