Skip to content

[Bug] update 3.0.8 to 4.0.2 one of three be NODE restart every 2min #59576

@paukey

Description

@paukey

Search before asking

  • I had searched in the issues and found no similar issues.

Version

3BE + 3FE
VERSION 3.0.8 -> 4.0.2
only one be node crash (always restart)
be.out ↓

INFO: java_cmd /home/doris/jdk-17.0.14//bin/java
INFO: jdk_version 17
StdoutLogger 2026-01-05 20:30:27,109 Start time: 2026年 01月 05日 星期一 20:30:27 CST
INFO: java_cmd /home/doris/jdk-17.0.14//bin/java
INFO: jdk_version 17
OpenJDK 64-Bit Server VM warning: Option CriticalJNINatives was deprecated in version 16.0 and will likely be removed in a future release.
SLF4J(W): Class path contains multiple SLF4J providers.
SLF4J(W): Found provider [org.apache.logging.slf4j.SLF4JServiceProvider@6e3c1e69]
SLF4J(W): Found provider [org.slf4j.reload4j.Reload4jServiceProvider@1888ff2c]
SLF4J(W): See https://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J(I): Actual provider is of type [org.apache.logging.slf4j.SLF4JServiceProvider@6e3c1e69]
start BE in local mode
*** Query id: cb44edd2f46c2d5b-f53cf473ac5b9d92 ***
*** is nereids: 1 ***
*** tablet id: 0 ***
*** Aborted at 1767616255 (unix time) try "date -d @1767616255" if you are using GNU date ***
*** Current BE git commitID: 30d2df0459 ***
*** SIGSEGV address not mapped to object (@0x8) received by PID 6368 (TID 7171 OR 0x7f5a3dcba700) from PID 8; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_release/doris/be/src/common/signal_handler.h:420
 1# PosixSignals::chained_handler(int, siginfo*, void*) [clone .part.0] in /home/doris/jdk-17.0.14//lib/server/libjvm.so
 2# JVM_handle_linux_signal in /home/doris/jdk-17.0.14//lib/server/libjvm.so
 3# 0x00007F76C861D630 in /lib64/libpthread.so.0
 4# doris::vectorized::FileScanner::_convert_to_output_block(doris::vectorized::Block*) at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/scan/file_scanner.cpp:794
 5# doris::vectorized::FileScanner::_get_block_wrapped(doris::RuntimeState*, doris::vectorized::Block*, bool*) at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/scan/file_scanner.cpp:496
 6# doris::vectorized::FileScanner::_get_block_impl(doris::RuntimeState*, doris::vectorized::Block*, bool*) at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/scan/file_scanner.cpp:411
 7# doris::vectorized::Scanner::get_block(doris::RuntimeState*, doris::vectorized::Block*, bool*) in /home/doris/be/lib/doris_be
 8# doris::vectorized::Scanner::get_block_after_projects(doris::RuntimeState*, doris::vectorized::Block*, bool*) at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/scan/scanner.cpp:87
 9# doris::vectorized::ScannerScheduler::_scanner_scan(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>) at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/scan/scanner_scheduler.cpp:173
10# std::_Function_handler<bool (), doris::vectorized::ScannerScheduler::submit(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>)::$_0::operator()() const::{lambda()#1}>::_M_invoke(std::_Any_data const&) at /usr/local/ldb-toolchain-v0.26/bin/../lib/gcc/x86_64-pc-linux-gnu/15/include/g++-v15/bits/std_function.h:292
11# doris::vectorized::ScannerSplitRunner::process_for(std::chrono::duration<long, std::ratio<1l, 1000000000l> >) at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/scan/scanner_scheduler.cpp:408
12# doris::vectorized::PrioritizedSplitRunner::process() at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/executor/time_sharing/prioritized_split_runner.cpp:104
13# doris::vectorized::TimeSharingTaskExecutor::_dispatch_thread() at /home/zcp/repo_center/doris_release/doris/be/src/vec/exec/executor/time_sharing/time_sharing_task_executor.cpp:572
14# doris::Thread::supervise_thread(void*) at /home/zcp/repo_center/doris_release/doris/be/src/util/thread.cpp:461
15# start_thread in /lib64/libpthread.so.0
16# clone in /lib64/libc.so.6

then be.WARN
W20260105 19:10:05.586156 19703 stream_load_executor.cpp:197] begin transaction failed, errmsg=[E-240]Have not get FE Master heartbeat yet

What's Wrong?

new version 4.0.2 one be node can't run stable.always restart

What You Expected?

fix this pls

How to Reproduce?

No response

Anything Else?

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions