-
Notifications
You must be signed in to change notification settings - Fork 6
Description
I'm encountering a seg fault from trklet::ProducerFakeDR when running the HYBRID_NEWKF_DISPLACED_MERGE configuration on a couple of SUSY PU200 sample files.
One of these files is /store/mc/Phase2Spring24DIGIRECOMiniAOD/DisplacedSUSY_stopToBottom_M-800_50mm_TuneCP5_14TeV-pythia8/GEN-SIM-DIGI-RAW-MINIAOD/PU200_AllTP_140X_mcRun4_realistic_v4-v1/2810000/fa1a17e4-27ef-4bce-a570-b1e03fc7a155.root
The job crashes at Event 6899, but only when processing the full file. If I run Event 6899 or 7000 in isolation, it processes successfully. This suggests the crash is due to a state or memory issue carrying over from a previous event.
This is the error message:
Begin processing the 901st record. Run 1, Event 6899, LumiSection 7 on stream 0 at 11-Dec-2025 21:16:38.219 CET
A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.
Current Modules:
Module: trklet::ProducerFakeDR:ProducerFakeDR (crashed)
A fatal system signal has occurred: segmentation violation
Segmentation fault (core dumped)
Here is the gdb debug output:
Thread 1 "cmsRun" received signal SIGSEGV, Segmentation fault.
0x00007ffff7984fd3 in edm::RefCore::RefCore(edm::RefCore const&) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libDataFormatsCommon.so
#0 0x00007ffff7984fd3 in edm::RefCore::RefCore(edm::RefCore const&) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libDataFormatsCommon.so
#1 0x00007fffc7f4cdf6 in trklet::ProducerFakeDR::produce(edm::Event&, edm::EventSetup const&) () from /afs/cern.ch/user/a/alabdelh/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/pluginTrackFindingTrackletPlugins.so
#2 0x00007ffff7e5ef15 in edm::stream::EDProducerAdaptorBase::doEvent(edm::EventTransitionInfo const&, edm::ActivityRegistry*, edm::ModuleCallingContext const*) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#3 0x00007ffff7e4166c in edm::WorkerT<edm::stream::EDProducerAdaptorBase>::implDo(edm::EventTransitionInfo const&, edm::ModuleCallingContext const*) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#4 0x00007ffff7dc4f79 in std::__exception_ptr::exception_ptr edm::Worker::runModuleAfterAsyncPrefetch<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >(std::__exception_ptr::exception_ptr, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::TransitionInfoType const&, edm::StreamID, edm::ParentContext const&, edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1>::Context const*) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#5 0x00007ffff7dd1344 in edm::Worker::RunModuleTask<edm::OccurrenceTraits<edm::EventPrincipal, (edm::BranchActionType)1> >::execute() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#6 0x00007ffff7f4f828 in tbb::detail::d2::function_task<edm::WaitingTaskList::announce()::{lambda()#1}>::execute(tbb::detail::d1::execution_data&) () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreConcurrency.so
#7 0x00007ffff7a3c84b in tbb::detail::r1::task_dispatcher::local_wait_for_all<false, tbb::detail::r1::external_waiter> (waiter=..., t=0x7ffff5bda100, this=<optimized out>) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/task_dispatcher.h:334
#8 tbb::detail::r1::task_dispatcher::local_wait_for_all<tbb::detail::r1::external_waiter> (waiter=..., t=<optimized out>, this=<optimized out>) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/task_dispatcher.h:470
#9 tbb::detail::r1::task_dispatcher::execute_and_wait (t=<optimized out>, wait_ctx=..., w_ctx=...) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/task_dispatcher.cpp:168
#10 0x00007ffff7d4728f in edm::FinalWaitingTask::wait() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#11 0x00007ffff7d56dde in edm::EventProcessor::processRuns() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#12 0x00007ffff7d50221 in edm::EventProcessor::runToCompletion() () from /cvmfs/cms.cern.ch/el9_amd64_gcc12/cms/cmssw/CMSSW_15_1_0_pre4/lib/el9_amd64_gcc12/libFWCoreFramework.so
#13 0x000000000040857b in tbb::detail::d1::task_arena_function<main::{lambda()#1}::operator()() const::{lambda()#1}, void>::operator()() const ()
#14 0x00007ffff7a2af41 in tbb::detail::r1::task_arena_impl::execute (ta=..., d=...) at /data/cmsbld/jenkins/workspace/build-any-ib/w/BUILD/el9_amd64_gcc12/external/tbb/v2022.0.0-2c8b19d7f71a88d9ed1f550c4776837f/tbb-v2022.0.0/src/tbb/arena.cpp:821
#15 0x000000000040a293 in main::{lambda()#1}::operator()() const ()
#16 0x00000000004051b8 in main ()
The other sample file /store/mc/Phase2Spring24DIGIRECOMiniAOD/DisplacedSUSY_stopToBottom_M-800_50mm_TuneCP5_14TeV-pythia8/GEN-SIM-DIGI-RAW-MINIAOD/PU200_AllTP_140X_mcRun4_realistic_v4-v1/2810000/a487a58e-d61e-4d1d-962c-f7229c07b32c.root crashes with error message:
Begin processing the 922nd record. Run 1, Event 5922, LumiSection 6 on stream 0 at 11-Dec-2025 21:17:04.192 CET
A fatal system signal has occurred: segmentation violation
The following is the call stack containing the origin of the signal.
Current Modules:
Module: trklet::ProducerFakeDR:ProducerFakeDR (crashed)