Add pT3 DNN to LST for Improved Fake Rejection #47995

GNiendorf · 2025-04-30T21:19:50Z

PR description:

This PR introduces an additional DNN to the LST codebase for better fake rejection of pT3 and pT5 objects. The DNN has a similar architecture to the other DNN's already present in LST: #47618 for T3's and #46857 for T5's. It uses six input features from the existing pT3 and pT5 cuts and applies an additional DNN-based cut to further reduce the LST fake rate. The reduction in fake rate is most pronounced at high pT in the default (pT > 0.8 GeV) configuration, as shown below. The DNN cut has negligible impacts on timing and efficiency.

This PR also adds my training notebook to the codebase, in line with the other DNN notebooks already present.

A detailed summary of the improvements can be found here: PR_162.pdf

Other minor changes:

The rz chi-squared value is now always computed, even for pT3 objects with pT > 5.0 GeV, to avoid overfitting on the previous default value of -1.
Minor naming cleanups in NeuralNetwork.h.

PR validation:

This PR was tested on CPU and GPU in the standalone configuration and runs without issue.

cmsbuild · 2025-04-30T21:20:13Z

cms-bot internal usage

cmsbuild · 2025-04-30T21:21:07Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-47995/44663

There are other open Pull requests which might conflict with changes you have proposed:
- File RecoTracker/LSTCore/src/alpaka/PixelQuintuplet.h modified in PR(s): Migrate LST inputs to SoA collections #47793
- File RecoTracker/LSTCore/src/alpaka/PixelTriplet.h modified in PR(s): Migrate LST inputs to SoA collections #47793
- File RecoTracker/LSTCore/standalone/code/core/write_lst_ntuple.cc modified in PR(s): Migrate LST inputs to SoA collections #47793

cmsbuild · 2025-04-30T21:21:26Z

A new Pull Request was created by @GNiendorf for master.

It involves the following packages:

RecoTracker/LSTCore (reconstruction)

@cmsbuild, @jfernan2, @mandrenguyen can you please review it and eventually sign? Thanks.
@GiacomoSguazzoni, @VinInn, @VourMa, @dgulhan, @felicepantaleo, @gpetruc, @missirol, @mmusich, @mtosi, @rovere this is something you requested to watch as well.
@antoniovilela, @mandrenguyen, @rappoccio, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

slava77 · 2025-04-30T21:58:52Z

test parameters:

enable_tests = gpu
workflows_gpu = 29634.704,29834.704
workflows = 29634.703,29834.703,29834.755
relvals_opt = -w upgrade,standard
relvals_opt_gpu = -w upgrade,standard

slava77 · 2025-04-30T22:00:22Z

@cmsbuild please test

jfernan2 · 2025-05-01T08:54:02Z

assign heterogeneous

cmsbuild · 2025-05-01T09:17:48Z

New categories assigned: heterogeneous

@fwyzard,@makortel you have been requested to review this Pull request/Issue and eventually sign? Thanks

slava77 · 2025-05-01T13:43:29Z

@cmsbuild please test

it took 15 hours in the last attempt. I'm guessing something got stuck.

iarspider · 2025-05-01T14:10:50Z

@slava77 tests were not ran overnight (java update, node that cms-bot uses needed to be reconnected by hand), plus rocm nodes were broken until about 5 minutes ago (thanks for @fwyzard for fixing then!)

fwyzard · 2025-05-01T14:11:17Z

it took 15 hours in the last attempt. I'm guessing something got stuck.

Indeed, there has been both a general issue and a ROCm-specific issue. @iarspider has been following up on them, and both should be resolved now.
Now there should only be some backlog to go though.

fwyzard · 2025-05-01T14:12:07Z

(also, it's a holiday here today)

slava77 · 2025-05-01T14:19:07Z

(also, it's a holiday here today)

sure, although I assume the right to not work does not apply to cms-bot; or does it?

fwyzard · 2025-05-01T14:24:49Z

well, it does apply to the people the maintain it.

cmsbuild · 2025-05-01T18:34:41Z

+1

Size: This PR adds an extra 16KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ba1e61/45805/summary.html
COMMIT: f1fd6d5
CMSSW: CMSSW_15_1_X_2025-05-01-1100/el8_amd64_gcc12
Additional Tests: CUDA,ROCM
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/47995/45805/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially removed 2 lines from the logs
Reco comparison results: 263 differences found in the comparisons
DQMHistoTests: Total files compared: 55
DQMHistoTests: Total histograms compared: 4355745
DQMHistoTests: Total failures: 6011
DQMHistoTests: Total nulls: 4
DQMHistoTests: Total successes: 4349710
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 54 files compared)
Checked 234 log files, 202 edm output root files, 55 DQM output files
TriggerResults: no differences found

CUDA Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 1
DQMHistoTests: Total histograms compared: 0
DQMHistoTests: Total failures: 0
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 0
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0 KiB( 0 files compared)
Checked 0 log files, 0 edm output root files, 1 DQM output files

ROCM Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 1
DQMHistoTests: Total histograms compared: 0
DQMHistoTests: Total failures: 0
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 0
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0 KiB( 0 files compared)
Checked 0 log files, 0 edm output root files, 1 DQM output files

fwyzard · 2025-05-07T11:48:01Z

+heterogeneous

slava77 · 2025-05-07T16:16:52Z

It can be reviewed as soon as @makortel or myself have the time to do it.

Thank you very much for the +1 already.

Unless it is particularly urgent and should skip in front of our "to do" list ? If that is the case, please explain the urgency.

There are a few active PRs in the pipeline (it's a somewhat stable flow); adding more bubbles in the pipeline is somewhat disruptive. So, being able to integrate something straightforward without significant delays would be quite helpful.

We are actively addressing the to-do list (#47793 is a clear and current evidence of that effort), but this is done in balance with the ongoing developments.
It would be nice if we can continue this way.

slava77 · 2025-05-08T14:24:53Z

@cmsbuild, please test

I see that almost 20 hours after the start request

cms/47995/el8_amd64_gcc12/relvals/cudaWaiting for status to be reported — Waiting for tests to start

Is there a problem with the CUDA testing infrastructure?

iarspider · 2025-05-08T16:01:15Z

I have retriggered CUDA relvals, now checking why they didn't get triggered.

cmsbuild · 2025-05-08T16:56:30Z

+1

Size: This PR adds an extra 20KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ba1e61/45896/summary.html
COMMIT: f1fd6d5
CMSSW: CMSSW_15_1_X_2025-05-06-2300/el8_amd64_gcc12
Additional Tests: CUDA,ROCM
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/47995/45896/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially removed 197 lines from the logs
Reco comparison results: 255 differences found in the comparisons
DQMHistoTests: Total files compared: 57
DQMHistoTests: Total histograms compared: 4405491
DQMHistoTests: Total failures: 6740
DQMHistoTests: Total nulls: 4
DQMHistoTests: Total successes: 4398727
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 56 files compared)
Checked 240 log files, 206 edm output root files, 57 DQM output files
TriggerResults: no differences found

CUDA Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 1
DQMHistoTests: Total histograms compared: 0
DQMHistoTests: Total failures: 0
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 0
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0 KiB( 0 files compared)
Checked 0 log files, 0 edm output root files, 1 DQM output files

ROCM Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 1
DQMHistoTests: Total histograms compared: 0
DQMHistoTests: Total failures: 0
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 0
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0 KiB( 0 files compared)
Checked 0 log files, 0 edm output root files, 1 DQM output files

slava77 · 2025-05-08T18:16:40Z

Comparison Summary

I'm not sure 29834.755 (LST building in HLT) https://tinyurl.com/2bw5h2l7 is expected.

The offline variant from 29834.703 looks more consistent with the expectation https://tinyurl.com/24oulncd (is supposedly an equivalent of the plot posted in the PR description)

slava77 · 2025-05-08T21:55:59Z

@mmusich @VourMa
is HLT:75e33_timing known to be reproducible (in particular in the context of 29834.755 wf)?

mmusich · 2025-05-08T22:06:45Z

is HLT:75e33_timing known to be reproducible

In general, that one is.

(in particular in the context of 29834.755 wf)?

I didn't check the ones gated by the LST modifiers, so I would not bet on that, though I don't have indications of the contrary.

slava77 · 2025-05-08T22:44:16Z

I looked at absolute entries in num_reco_pT and num_assoc(recoToSim)_pT, which are supposed to define the inputs to the fake rate , starting from the first bin in the snapshot above

PR					Reference
all		ma		fk	al		ma		fk
2224		2198		26	2204		2169		35
1916		1899		17	1901		1880		21
1421		1413		8	1415		1406		9
920		914		6	916		909		7

.. Ah, it's the "Overlay+ratio" that's messed up. If I select just the overlay, the plots are as expected:

mmusich · 2025-05-09T06:08:43Z

+hlt

cmsbuild · 2025-05-09T06:09:06Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @rappoccio, @sextonkennedy, @mandrenguyen, @antoniovilela (and backports should be raised in the release meeting by the corresponding L2)

mandrenguyen · 2025-05-09T06:27:11Z

+1

Add pT3 DNN to LST

f1fd6d5

cmsbuild added this to the CMSSW_15_1_X milestone Apr 30, 2025

cmsbuild added reconstruction-pending pending-signatures tests-pending orp-pending code-checks-pending tracking labels Apr 30, 2025

cmsbuild added code-checks-approved and removed code-checks-pending labels Apr 30, 2025

cmsbuild added tests-started and removed tests-pending labels Apr 30, 2025

cmsbuild mentioned this pull request Apr 30, 2025

Migrate LST inputs to SoA collections #47793

Merged

cmsbuild added the heterogeneous-pending label May 1, 2025

cmsbuild added tests-approved and removed tests-started labels May 1, 2025

cmsbuild added tests-started and removed tests-approved labels May 7, 2025

cmsbuild added heterogeneous-approved and removed heterogeneous-pending labels May 7, 2025

cmsbuild added tests-approved and removed tests-started labels May 8, 2025

cmsbuild added hlt-approved fully-signed and removed hlt-pending pending-signatures labels May 9, 2025

cmsbuild added orp-approved and removed orp-pending labels May 9, 2025

cmsbuild merged commit 30f4d4d into cms-sw:master May 9, 2025
18 checks passed

cmsbuild mentioned this pull request May 9, 2025

[ROOT6] Updated root to tip of branch master cms-sw/cmsdist#9840

Merged

GNiendorf mentioned this pull request May 9, 2025

LST pixelSeeds Eta Name Correction #48048

Merged

This was referenced May 9, 2025

[Do not merge] testing root6 PR tests cms-sw/cmsdist#9842

Closed

[CUDART] Improved cuda-runtime package cms-sw/cmsdist#9828

Merged

GNiendorf mentioned this pull request Jun 4, 2025

Track Embeddings for Improved Duplicate Removal in LST #48249

Merged

Add pT3 DNN to LST for Improved Fake Rejection #47995

Add pT3 DNN to LST for Improved Fake Rejection #47995

Uh oh!

Conversation

GNiendorf commented Apr 30, 2025

PR description:

PR validation:

Uh oh!

cmsbuild commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cmsbuild commented Apr 30, 2025

Uh oh!

cmsbuild commented Apr 30, 2025

Uh oh!

slava77 commented Apr 30, 2025

Uh oh!

slava77 commented Apr 30, 2025

Uh oh!

jfernan2 commented May 1, 2025

Uh oh!

cmsbuild commented May 1, 2025

Uh oh!

slava77 commented May 1, 2025

Uh oh!

iarspider commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fwyzard commented May 1, 2025

Uh oh!

fwyzard commented May 1, 2025

Uh oh!

slava77 commented May 1, 2025

Uh oh!

fwyzard commented May 1, 2025

Uh oh!

cmsbuild commented May 1, 2025

Comparison Summary

CUDA Comparison Summary

ROCM Comparison Summary

Uh oh!

fwyzard commented May 7, 2025

Uh oh!

slava77 commented May 7, 2025

Uh oh!

slava77 commented May 8, 2025

Uh oh!

iarspider commented May 8, 2025

Uh oh!

cmsbuild commented May 8, 2025

Comparison Summary

CUDA Comparison Summary

ROCM Comparison Summary

Uh oh!

slava77 commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slava77 commented May 8, 2025

Uh oh!

mmusich commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slava77 commented May 8, 2025

Uh oh!

mmusich commented May 9, 2025

Uh oh!

cmsbuild commented May 9, 2025

Uh oh!

mandrenguyen commented May 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

cmsbuild commented Apr 30, 2025 •

edited

Loading

iarspider commented May 1, 2025 •

edited

Loading

slava77 commented May 8, 2025 •

edited

Loading

mmusich commented May 8, 2025 •

edited

Loading