Skip to content

Conversation

@silviodonato
Copy link
Contributor

@silviodonato silviodonato commented Nov 10, 2025

PR description:

DQM Offline for Scouting was added by #47235.
It was activated in PromptReco at T0 only for ScoutingPFMonitor dataset by dmwm/T0#5053, specifically by adding @hltScouting to dqm_sequences in ProdOfflineConfiguration.py.
As discussed at the trigger review, it would be useful to have these DQM plots also in RelVals -- and also in AlCaVal -- to monitor the performance of HLT/Scouting objects.

Note: Currently Scouting DQM requires scouting object to run (otherwise it will crash). This PR changes the JME scouting DQM (jetMETDQMOfflineSourceScouting) to use Task. In this way the crashing module (ak4PFScoutingL1FastjetCorrector) will run only when requested by the JME scouting module (jetDQMAnalyzerSequenceScouting), ie. when scouting objects exists. In this way we avoid the crash (cc @etzovara).
Note-2: egmGsfElectronIDsForScoutingDQM will use gedGsfElectrons instead of slimmedElectrons to be compatible with RECO without AOD (cc @tihsu99)

This PR simply adds scouting DQM to the trigger DQM sequence.

PR validation:

I'm having some problems with the validation (#49331). I would like to check that all corner cases are covered (in particular I suspect that a re-reco of a non-scouting dataset might have problems)

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

To be defined

(previous attempts #49350 #49336)

FYI @patinkaew @cms-sw/hlt-l2

@cmsbuild
Copy link
Contributor

cmsbuild commented Nov 10, 2025

cms-bot internal usage

@cmsbuild
Copy link
Contributor

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-49358/46748

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @silviodonato for master.

It involves the following packages:

  • DQMOffline/Configuration (dqm)
  • DQMOffline/JetMET (dqm)
  • HLTriggerOffline/Scouting (dqm)

@cmsbuild, @ctarricone, @gabrielmscampos, @nothingface0, @rseidita can you please review it and eventually sign? Thanks.
@ahinzmann, @gkasieczka, @jdamgov, @jdolen, @mariadalfonso, @missirol, @mmarionncern, @mmusich, @mtosi, @nhanvtran, @rappoccio, @rociovilar, @schoef, @seemasharmafnal, @threus this is something you requested to watch as well.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

@silviodonato
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

-1

Failed Tests: RelVals AddOn
Size: This PR adds an extra 28KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-255040/49379/summary.html
COMMIT: 239e63f
CMSSW: CMSSW_16_0_X_2025-11-10-1100/el8_amd64_gcc13
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/49358/49379/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed RelVals

----- Begin Fatal Exception 10-Nov-2025 18:13:54 CET-----------------------
An exception of category 'InvalidReference' occurred while
   [0] Processing  Event run: 379416 lumi: 1 event: 508034 stream: 0
   [1] Running path 'dqmoffline_step'
   [2] Calling method for module ScoutingEGammaCollectionMonitoring/'ScoutingEGammaCollectionMonitoringOffline'
Exception Message:
ValueMap: no associated value for given product and index
----- End Fatal Exception -------------------------------------------------
----- Begin Fatal Exception 10-Nov-2025 18:16:10 CET-----------------------
An exception of category 'InvalidReference' occurred while
   [0] Processing  Event run: 386478 lumi: 17 event: 22456638 stream: 0
   [1] Running path 'dqmoffline_step'
   [2] Calling method for module ScoutingEGammaCollectionMonitoring/'ScoutingEGammaCollectionMonitoringOffline'
Exception Message:
ValueMap: no associated value for given product and index
----- End Fatal Exception -------------------------------------------------
----- Begin Fatal Exception 10-Nov-2025 18:18:14 CET-----------------------
An exception of category 'InvalidReference' occurred while
   [0] Processing  Event run: 383811 lumi: 28 event: 14800950 stream: 0
   [1] Running path 'dqmoffline_step'
   [2] Calling method for module ScoutingEGammaCollectionMonitoring/'ScoutingEGammaCollectionMonitoringOffline'
Exception Message:
ValueMap: no associated value for given product and index
----- End Fatal Exception -------------------------------------------------
Expand to see more relval errors ...

Failed AddOn Tests

UNKNOWN

@cmsbuild
Copy link
Contributor

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-49358/46751

@cmsbuild
Copy link
Contributor

Pull request #49358 was updated. @cmsbuild, @ctarricone, @gabrielmscampos, @nothingface0, @rseidita can you please check and sign again.

@silviodonato
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

+1

Size: This PR adds an extra 40KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-255040/49390/summary.html
COMMIT: c4165fe
CMSSW: CMSSW_16_0_X_2025-11-10-1100/el8_amd64_gcc13
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/49358/49390/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 10472 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 66 differences found in the comparisons
  • Reco comparison had 2 failed jobs
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3939953
  • DQMHistoTests: Total failures: 41
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3939892
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 41908.93 KiB( 50 files compared)
  • DQMHistoSizes: changed ( 1000.0,... ): 2993.495 KiB HLT/ScoutingOffline
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@cmsbuild
Copy link
Contributor

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-49358/46804

@cmsbuild
Copy link
Contributor

Pull request #49358 was updated. @cmsbuild, @ctarricone, @gabrielmscampos, @nothingface0, @rseidita can you please check and sign again.

@silviodonato
Copy link
Contributor Author

please test

(thanks Marco for spotting this bug!)

@cmsbuild
Copy link
Contributor

+1

Size: This PR adds an extra 28KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-255040/49432/summary.html
COMMIT: 330283e
CMSSW: CMSSW_16_0_X_2025-11-12-2300/el8_amd64_gcc13
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/49358/49432/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 900 lines to the logs
  • Reco comparison results: 10 differences found in the comparisons
  • Reco comparison had 2 failed jobs
  • DQMHistoTests: Total files compared: 51
  • DQMHistoTests: Total histograms compared: 3880468
  • DQMHistoTests: Total failures: 7
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3880441
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 11973.98 KiB( 50 files compared)
  • DQMHistoSizes: changed ( 16834.0,... ): 2993.495 KiB HLT/ScoutingOffline
  • Checked 218 log files, 188 edm output root files, 51 DQM output files
  • TriggerResults: no differences found

@silviodonato
Copy link
Contributor Author

  • I finally see the new plots at least for 2025_0010001.
  • It adds 904 lines to 2025.0000001_RunZeroBias2025B_10k because the scouting objects were unexpectedly missing (data from Emergency column with high randoms)
  • No crashes
  • I opened mayConsume runs prefetching #49377 to understand how to run the scouting jets corrector in DQM only when the scouting object is present

@mmusich @cms-sw/dqm-l2 do you have comments?

@mmusich
Copy link
Contributor

mmusich commented Nov 13, 2025

It adds 904 lines to 2025.0000001_RunZeroBias2025B_10k because the scouting objects were unexpectedly missing (data from Emergency column with high randoms)

I guess we need PdmV to choose a better LS here (not sure if the data is available at CERN on disk).
I am not a huge fan of the duplicated sequence, but I guess it's OK for the time being, while we look for a better long term solution.
(incidentally this PR doesn't need hlt signature any longer, so if @cms-sw/dqm-l2 are OK I am also in favor of merging).

@gabrielmscampos
Copy link
Member

+dqm

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @sextonkennedy, @mandrenguyen, @ftenchini (and backports should be raised in the release meeting by the corresponding L2)

@mandrenguyen
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit 319b2f4 into cms-sw:master Nov 14, 2025
10 checks passed
@Martin-Grunewald
Copy link
Contributor

It seems this PR creates an error in the HLT-Validation test for HIon:

----- Begin Fatal Exception 15-Nov-2025 05:40:02 CET-----------------------
An exception of category 'ProductNotFound' occurred while
   [0] Processing  Event run: 362321 lumi: 231 event: 111758081 stream: 1
   [1] Running path 'dqmoffline_step'
   [2] Calling method for module VersionedGsfElectronIdProducer/'egmGsfElectronIDsForScoutingDQM'
Exception Message:
Principal::getByToken: Found zero products matching all criteria
Looking for a container with elements of type: reco::GsfElectron
Looking for module label: slimmedElectrons
Looking for productInstanceName: 

   Additional Info:
      [a] If you wish to continue processing events after a ProductNotFound exception,
add "TryToContinue = cms.untracked.vstring('ProductNotFound')" to the "options" PSet in the configuration.

----- End Fatal Exception -------------------------------------------------

See here https://cmssdt.cern.ch/SDT/jenkins-artifacts/HLT-Validation/CMSSW_16_0_X_2025-11-15-1100/el8_amd64_gcc13/RelVal_RECO_HIon_DATA.log

@silviodonato
Copy link
Contributor Author

Hi @Martin-Grunewald ,
I'm trying to debug. How can reproduce the crash?

cd HLTrigger/Configuration/test
./cmsDriver.csh HIon
cmsRun RelVal_HLT_HIon_DATA.py

?
Which RelVal_Raw_HIon_DATA.root shall I use?

@Martin-Grunewald
Copy link
Contributor

The IB tests: HLTrigger/Configuration/test/cmsDriver.csh ib

@silviodonato
Copy link
Contributor Author

silviodonato commented Nov 16, 2025

I added a fix in #49387 (comment)

Btw. in general we will have the crash in RECO,DQM steps without PAT in the middle, if scouting DQM is activated.
I noticed that in the matrix we always have RECO,PAT in standard pp collisions (see matrix), while we don't have PAT in cosmics and HIon

@mmusich
Copy link
Contributor

mmusich commented Dec 15, 2025

I guess we need PdmV to choose a better LS here (not sure if the data is available at CERN on disk).

I opened #49627 to keep track of this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants