Skip to content

Conversation

@schneiml
Copy link
Contributor

@schneiml schneiml commented Jun 4, 2020

PR description:

This PR removes (some?) of the remaining modules blocking concurrent lumisections in production jobs.

I ran a random 136 relval with concurrent lumis enabled and got these modules reported:

  RPCDcsInfo rpcDcsInfo
  DetStatus ALCARECOTkAlMinBiasDCSFilter
  EcalDQMonitorTask ecalMonitorTask
  ESIntegrityTask ecalPreshowerIntegrityTask
  L1TStage2CaloLayer1 l1tStage2CaloLayer1
  AlcaBeamMonitor AlcaBeamMonitor

(Update: DetStatus is gone (thanks @mmusich !), and after #30187, EcalDQMonitorTask is gone as well. ESIntegrityTask remains, however.)

DetStatus is not DQM related, I removed the others from the sequences. There might be more modules that are not used in this specific workflow.

Overall, some of these modules are quite important and we don't want to merge this PR. But it gives an idea of what these modules do and allows to try actually enabling concurrent lumis.

(@silviodonato update:
RPCDcsInfo rpcDcsInfo updated with #31056
ALCARECOTkAlMinBiasDCSFilter -> updated with #30124
)

PR validation:

None, so far; not supposed to be merged for now.

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

The code-checks are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-30111/15875

@schneiml
Copy link
Contributor Author

schneiml commented Jun 4, 2020

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/6823/console Started: 2020/06/04 19:31

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

A new Pull Request was created by @schneiml (Marcel Schneider) for master.

It involves the following packages:

DQM/EcalPreshowerMonitorModule
DQM/L1TMonitor
DQM/RPCMonitorClient
DQMOffline/Configuration
DQMOffline/Ecal

@andrius-k, @kmaeshima, @schneiml, @cmsbuild, @jfernan2, @fioriNTU can you please review it and eventually sign? Thanks.
@rchatter, @acimmino, @argiro, @threus, @rociovilar this is something you requested to watch as well.
@silviodonato, @dpiparo you are the release manager for this.

cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

+1
Tested at: 0dfe42c
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-73693c/6823/summary.html
CMSSW: CMSSW_11_2_X_2020-06-04-1100
SCRAM_ARCH: slc7_amd64_gcc820

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

Comparison job queued.

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 4, 2020

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-73693c/6823/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 36
  • DQMHistoTests: Total histograms compared: 2753347
  • DQMHistoTests: Total failures: 8321
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2745004
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -294786.719 KiB( 35 files compared)
  • DQMHistoSizes: changed ( 1000.0,... ): -2837.981 KiB EcalBarrel/EBTimingTask
  • DQMHistoSizes: changed ( 1000.0,... ): -1683.506 KiB EcalEndcap/EETimingTask
  • DQMHistoSizes: changed ( 1000.0,... ): -1166.814 KiB EcalBarrel/EBOccupancyTask
  • DQMHistoSizes: changed ( 1000.0,... ): -1092.231 KiB EcalBarrel/EBIntegrityTask
  • DQMHistoSizes: changed ( 1000.0,... ): -1081.266 KiB EcalBarrel/EBPedestalOnlineTask
  • DQMHistoSizes: changed ( 1000.0,... ): -705.107 KiB EcalEndcap/EETriggerTowerTask
  • DQMHistoSizes: changed ( 1000.0,... ): -628.273 KiB EcalEndcap/EEOccupancyTask
  • DQMHistoSizes: changed ( 1000.0,... ): -459.070 KiB EcalEndcap/EEPedestalOnlineTask
  • DQMHistoSizes: changed ( 1000.0,... ): -441.843 KiB EcalEndcap/EEIntegrityTask
  • DQMHistoSizes: changed ( 1000.0,... ): -340.997 KiB EcalEndcap/EESummaryClient
  • DQMHistoSizes: changed ( 1000.0 ): ...
  • Checked 152 log files, 16 edm output root files, 36 DQM output files

@silviodonato
Copy link
Contributor

@schneiml shall we include this in CMSSW_11_2_0_pre1?

@schneiml
Copy link
Contributor Author

schneiml commented Jun 9, 2020

@silviodonato no, we need at least Ecal properly fixed before this can go in (@tanmaymudholkar ?)

Then, we also need a proper fix or removal for DetStatus ALCARECOTkAlMinBiasDCSFilter before this can even be used for experiments with concurrent lumis.

@schneiml
Copy link
Contributor Author

schneiml commented Jun 9, 2020

hold

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 9, 2020

Pull request has been put on hold by @schneiml
They need to issue an unhold command to remove the hold state or L1 can unhold it for all

@cmsbuild cmsbuild added the hold label Jun 9, 2020
@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2020

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-73693c/9087/summary.html

Comparison Summary:

  • You potentially added 5922 lines to the logs
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 35
  • DQMHistoTests: Total histograms compared: 2602902
  • DQMHistoTests: Total failures: 505
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2602375
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -111013.462 KiB( 34 files compared)
  • DQMHistoSizes: changed ( 1000.0,... ): -540.738 KiB EcalBarrel/EBPedestalOnlineTask
  • DQMHistoSizes: changed ( 1000.0,... ): -540.000 KiB EcalBarrel/EBTimingTask
  • DQMHistoSizes: changed ( 1000.0,... ): -290.679 KiB EcalPreshower/ESIntegrityTask
  • DQMHistoSizes: changed ( 1000.0,... ): -285.020 KiB EcalBarrel/EBSummaryClient
  • DQMHistoSizes: changed ( 1000.0,... ): -251.787 KiB EcalEndcap/EESummaryClient
  • DQMHistoSizes: changed ( 1000.0,... ): -243.489 KiB EcalBarrel/EBIntegrityTask
  • DQMHistoSizes: changed ( 1000.0,... ): -243.270 KiB EcalBarrel/EBIntegrityClient
  • DQMHistoSizes: changed ( 1000.0,... ): -229.588 KiB EcalEndcap/EEPedestalOnlineTask
  • DQMHistoSizes: changed ( 1000.0,... ): -229.219 KiB EcalEndcap/EETimingTask
  • DQMHistoSizes: changed ( 1000.0,... ): -162.824 KiB EcalEndcap/EETriggerTowerTask
  • DQMHistoSizes: changed ( 1000.0 ): ...
  • Checked 149 log files, 22 edm output root files, 35 DQM output files

@silviodonato
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 4, 2020

The tests are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 4, 2020

+1
Tested at: 6893487
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-73693c/9121/summary.html
CMSSW: CMSSW_11_2_X_2020-09-03-2300
SCRAM_ARCH: slc7_amd64_gcc820

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 4, 2020

Comparison job queued.

@silviodonato
Copy link
Contributor

If I understand correctly,
RPCDcsInfo rpcDcsInfo has been updated with #31056
ALCARECOTkAlMinBiasDCSFilter has been updated with #30124
AlcaBeamMonitor will be updated with #31354.

This means that after having merged #31354 the list of the offending modules will be

  EcalDQMonitorTask ecalMonitorTask
  ESIntegrityTask ecalPreshowerIntegrityTask
  L1TStage2CaloLayer1 l1tStage2CaloLayer1

After having merged #31354, I would suggest @cms-sw/dqm-l2 to update this PR, re-adding the modules that have been migrated.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 4, 2020

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-73693c/9121/summary.html

Comparison Summary:

  • You potentially added 5942 lines to the logs
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 35
  • DQMHistoTests: Total histograms compared: 2602902
  • DQMHistoTests: Total failures: 505
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2602375
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -111013.462 KiB( 34 files compared)
  • DQMHistoSizes: changed ( 1000.0,... ): -540.738 KiB EcalBarrel/EBPedestalOnlineTask
  • DQMHistoSizes: changed ( 1000.0,... ): -540.000 KiB EcalBarrel/EBTimingTask
  • DQMHistoSizes: changed ( 1000.0,... ): -290.679 KiB EcalPreshower/ESIntegrityTask
  • DQMHistoSizes: changed ( 1000.0,... ): -285.020 KiB EcalBarrel/EBSummaryClient
  • DQMHistoSizes: changed ( 1000.0,... ): -251.787 KiB EcalEndcap/EESummaryClient
  • DQMHistoSizes: changed ( 1000.0,... ): -243.489 KiB EcalBarrel/EBIntegrityTask
  • DQMHistoSizes: changed ( 1000.0,... ): -243.270 KiB EcalBarrel/EBIntegrityClient
  • DQMHistoSizes: changed ( 1000.0,... ): -229.588 KiB EcalEndcap/EEPedestalOnlineTask
  • DQMHistoSizes: changed ( 1000.0,... ): -229.219 KiB EcalEndcap/EETimingTask
  • DQMHistoSizes: changed ( 1000.0,... ): -162.824 KiB EcalEndcap/EETriggerTowerTask
  • DQMHistoSizes: changed ( 1000.0 ): ...
  • Checked 149 log files, 22 edm output root files, 35 DQM output files

@makortel
Copy link
Contributor

makortel commented Sep 4, 2020

AlcaBeamMonitor will be updated with #31354.

It seems to me that wrt. concurrent lumis AlcaBeamMonitor was updated in #31267 (#31354 only fixes a crash in case no lumis were processed).

@gennai
Copy link
Contributor

gennai commented Sep 4, 2020

yes, at least I think I have fixed it (well I have followed the guidelines maybe I made something wrong?)

@silviodonato
Copy link
Contributor

AlcaBeamMonitor will be updated with #31354.

It seems to me that wrt. concurrent lumis AlcaBeamMonitor was updated in #31267 (#31354 only fixes a crash in case no lumis were processed).

yes, thanks.
@cms-sw/dqm-l2 could you update this pull request removing only

  EcalDQMonitorTask ecalMonitorTask
  ESIntegrityTask ecalPreshowerIntegrityTask
  L1TStage2CaloLayer1 l1tStage2CaloLayer1

?

@jfernan2
Copy link
Contributor

jfernan2 commented Sep 5, 2020

@silviodonato the owner of this PR is no longer in CMS (@schneiml ), so I understand I cannot modify this PR. I may create a new equivalent on Monday once I come back from Holidays if it is still on time. Thanks

@silviodonato
Copy link
Contributor

@silviodonato the owner of this PR is no longer in CMS (@schneiml ), so I understand I cannot modify this PR. I may create a new equivalent on Monday once I come back from Holidays if it is still on time. Thanks

Yes, sure. Thanks @jfernan2 !

@jfernan2 jfernan2 mentioned this pull request Sep 5, 2020
@jfernan2
Copy link
Contributor

jfernan2 commented Sep 5, 2020

@silviodonato I managed to get a stable connection and made the PR, so you can close this one.
#31369

@silviodonato
Copy link
Contributor

@jfernan2 thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants