Skip to content

Segmentation Fault While Running HYBRID_NEWKF_DISPLACED_MERGE #345

@A-A-Abdelhamid

Description

@A-A-Abdelhamid

I'm encountering a seg fault from trklet::ProducerFakeDR when running the HYBRID_NEWKF_DISPLACED_MERGE configuration on a couple of SUSY PU200 sample files.
One of these files is /store/mc/Phase2Spring24DIGIRECOMiniAOD/DisplacedSUSY_stopToBottom_M-800_50mm_TuneCP5_14TeV-pythia8/GEN-SIM-DIGI-RAW-MINIAOD/PU200_AllTP_140X_mcRun4_realistic_v4-v1/2810000/fa1a17e4-27ef-4bce-a570-b1e03fc7a155.root

The job crashes at Event 6899, but only when processing the full file. If I run Event 6899 or 7000 in isolation, it processes successfully. This suggests the crash is due to a state or memory issue carrying over from a previous event.

This is the error message:

Begin processing the 901st record. Run 1, Event 6899, LumiSection 7 on stream 0 at 11-Dec-2025 21:16:38.219 CET

A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.


Current Modules:

Module: trklet::ProducerFakeDR:ProducerFakeDR (crashed)

A fatal system signal has occurred: segmentation violation
Segmentation fault (core dumped)

Here is the gdb debug output:

Thread 1 "cmsRun" received signal SIGSEGV, Segmentation fault.
0x00007ffff7984fd3 in edm::RefCore::RefCore(edm::RefCore const&) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libDataFormatsCommon.so
#0  0x00007ffff7984fd3 in edm::RefCore::RefCore(edm::RefCore const&) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libDataFormatsCommon.so
#1  0x00007fffc7f4cdf6 in trklet::ProducerFakeDR::produce(edm::Event&, edm::EventSetup const&) () from /afs/cern.ch/user/a/alabdelh/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/pluginTrackFindingTrackletPlugins.so
#2  0x00007ffff7e5ef15 in edm::stream::EDProducerAdaptorBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#3  0x00007ffff7e4166c in edm::WorkerT<edm::stream::EDProducerAdaptorBase>::implDo(edm::EventTransitionInfo const&, edm::ModuleCallingContext const*) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#4  0x00007ffff7dc4f79 in std::__exception_ptr::exception_ptr edm::Worker::runModuleAfterAsyncPrefetch<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(std::__exception_ptr::exception_ptr, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#5  0x00007ffff7dd1344 in edm::Worker::RunModuleTask<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >::execute() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#6  0x00007ffff7f4f828 in tbb::detail::d2::function_task<edm::WaitingTaskList::announce()::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreConcurrency.so
#7  0x00007ffff7a3c84b in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::external_waiter> (waiter=..., t=0x7ffff5bda100, this=<optimized out>) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/task_dispatcher.h:334
#8  tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::external_waiter> (waiter=..., t=<optimized out>, this=<optimized out>) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/task_dispatcher.h:470
#9  tbb::detail::r1::task_dispatcher::execute_and_wait (t=<optimized out>, wait_ctx=..., w_ctx=...) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/task_dispatcher.cpp:168
#10 0x00007ffff7d4728f in edm::FinalWaitingTask::wait() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#11 0x00007ffff7d56dde in edm::EventProcessor::processRuns() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#12 0x00007ffff7d50221 in edm::EventProcessor::runToCompletion() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#13 0x000000000040857b in tbb::detail::d1::task_arena_function<main::{lambda()#1}::operator()() const::{lambda()#1}, void>::operator()() const ()
#14 0x00007ffff7a2af41 in tbb::detail::r1::task_arena_impl::execute (ta=..., d=...) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/arena.cpp:821
#15 0x000000000040a293 in main::{lambda()#1}::operator()() const ()
#16 0x00000000004051b8 in main ()

The other sample file /store/mc/Phase2Spring24DIGIRECOMiniAOD/DisplacedSUSY_stopToBottom_M-800_50mm_TuneCP5_14TeV-pythia8/GEN-SIM-DIGI-RAW-MINIAOD/PU200_AllTP_140X_mcRun4_realistic_v4-v1/2810000/a487a58e-d61e-4d1d-962c-f7229c07b32c.root crashes with error message:

Begin processing the 922nd record. Run 1, Event 5922, LumiSection 6 on stream 0 at 11-Dec-2025 21:17:04.192 CET

A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.

Current Modules:
Module: trklet::ProducerFakeDR:ProducerFakeDR (crashed)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions