Skip to content

Conversation

@fwyzard
Copy link
Contributor

@fwyzard fwyzard commented Feb 2, 2023

Update the version of alpaka to the HEAD of the develop branch as of 2023.02.01, corresponding to the commit alpaka-group/alpaka@a68c866cc6c .

Major changes:

  • remove the functions to pin/unpin an existing buffer; host memory buffers pinned/mapped for a particular accelerator platform can be obtained with allocMappedBuf(...);
  • implement accelerator tags, to list all available accelerators;
  • drop the Boost.Fiber back-end;
  • add support for newer CUDA and ROCm back-ends;
  • add optional support for mdspan

Other changes:

  • add deduction guide for Vec;
  • add element-wise_min and max functions;
  • make CUDA/ROCm mangled kernel names as short as possible;
  • add math hyperbolic functions;
  • refactor ConcurrentExecPool and QueueGenericThreadsNonBlocking.
  • implement trait constants.

Add support for the HIP/ROCm alpaka back-end.

Update the version of alpaka to the HEAD of the develop branch as of 2023.02.01,
corresponding to the commit a68c866cc6c .

Major changes:
  - remove the functions to pin/unpin an existing buffer; host memory
    buffers pinnd/mapped for a particular accelerator platform can be
    obtained with `allocMappedBuf(...)`;
  - implement accelerator tags, to list all available acceerators;
  - drop the Boost.Fiber back-end;
  - add support for newer CUDA and ROCm backends;
  - add optional support for mdspan

Other changes:
  - add deduction guide for `Vec`;
  - add elementwise_min and max functions;
  - make CUDA/ROCm mangled kernel names as short as possible;
  - add math hyperbolic functions;
  - refactor `ConcurrentExecPool` and `QueueGenericThreadsNonBlocking`.
  - implement trait constants.
@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 2, 2023

A new Pull Request was created by @fwyzard (Andrea Bocci) for branch IB/CMSSW_13_0_X/master.

@cmsbuild, @smuzaffar, @aandvalenzuela, @iarspider can you please review it and eventually sign? Thanks.
@perrotta, @dpiparo, @rappoccio you are the release manager for this.
cms-bot commands are listed here

@fwyzard
Copy link
Contributor Author

fwyzard commented Feb 2, 2023

please test with cms-sw/cmssw-config#91, cms-sw/cmssw#40678

<flags CXXFLAGS="-DALPAKA_ACC_GPU_HIP_ENABLED -DALPAKA_HOST_ONLY"/>
<flags GENREFLEX_CPPFLAGS="-DALPAKA_ACC_GPU_HIP_ENABLED -DALPAKA_HOST_ONLY"/>
<!-- device comiplation should run without ALPAKA_HOST_ONLY defined -->
<flags ROCM_FLAGS="-DALPAKA_ACC_GPU_HIP_ENABLED -UALPAKA_HOST_ONLY"/>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As CXXFLAGS already has -DALPAKA_ACC_GPU_HIP_ENABLED so it is not needed here

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, you're right.

However, I've copied this from the CUDA case, and I would prefer to clean them all up together in a separate PR.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

-UALPAKA_HOST_ONLY should work too but if you prefer then you can use

<flags REM_ROCM_HOST_CXXFLAGS="-DALPAKA_HOST_ONLY"/>

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Following what you told me the other day, I was going to move the flags to CPPDEFINES - but I guess we do not have a REM_CPPDEFINES or REM_ROCM_HOST_CPPDEFINES functionality ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So I think we can stick to CXXFLAGS and use REM_CUDA_HOST_CXXFLAGS/REM_ROCM_HOST_CXXFLAGS .

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes that is correct, currently there is no REM_ROCM_HOST_CPPDEFINES , so lets stick to CXXFLAGS

@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 3, 2023

-1

Failed Tests: RelVals-INPUT
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d1ec0d/30366/summary.html
COMMIT: 7620e37
CMSSW: CMSSW_13_0_X_2023-02-02-1100/el8_amd64_gcc11
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8284/30366/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d1ec0d/30366/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d1ec0d/30366/git-merge-result

RelVals-INPUT

The relvals timed out after 4 hours.

Comparison Summary

Summary:

  • You potentially added 12 lines to the logs
  • Reco comparison results: 8 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3555495
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3555467
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 213 log files, 164 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

@smuzaffar
Copy link
Contributor

@fwyzard , input test failure are not related to this change. Do you want to add more changes here otherwise I am happy to include it for next IB.

@fwyzard
Copy link
Contributor Author

fwyzard commented Feb 3, 2023

@smuzaffar sorry, I missed the message earlier.

This PR needs to go in together with cms-sw/cmssw#40678; it would be great if both could enter in time for tonight's IB.

@perrotta @rappoccio FYI.

@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 3, 2023

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d1ec0d/30366/summary.html
COMMIT: 7620e37
CMSSW: CMSSW_13_0_X_2023-02-02-1100/el8_amd64_gcc11
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8284/30366/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d1ec0d/30366/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d1ec0d/30366/git-merge-result

Comparison Summary

Summary:

  • You potentially added 12 lines to the logs
  • Reco comparison results: 8 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3555495
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3555467
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 213 log files, 164 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

@perrotta
Copy link
Contributor

perrotta commented Feb 5, 2023

+1

@perrotta
Copy link
Contributor

perrotta commented Feb 5, 2023

merge

@cmsbuild cmsbuild merged commit 70e0c51 into cms-sw:IB/CMSSW_13_0_X/master Feb 5, 2023
@fwyzard fwyzard deleted the IB/CMSSW_13_0_X/master_alpaka_ROCm branch March 3, 2023 07:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants