Remove runtime recursion from find_ntuplets() #239

makortel · 2021-10-15T01:02:22Z

From cms-sw/cmssw#35473 and cms-sw/cmssw#35542 for all other versions than hip (that was done in #233).

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

makortel · 2021-10-15T01:07:02Z

src/alpaka/plugin-PixelTriplets/alpaka/GPUCACell.h

+      if constexpr (DEPTH == 0) {
+        printf("ERROR: GPUCACell::find_ntuplets reached full depth!\n");
+        ALPAKA_ASSERT_OFFLOAD(false);
+      } else {


@fwyzard @VinInn For Alpaka I went with this instead of the specialization for DEPTH == 0 because partial specializations of functions are not allowed ("partial" caused by the additional T_Acc template argument). By quick test the throughput improvement is similar order (3-4 %) than in cuda without caching/async allocator and in kokkos (that both use the specialization as in the original PR). If you have any better suggestions, let me know.

After seeing it, I actually like the approach with if constexpr better than the one with the specialisation for 0, as it keeps things more localised.

We should check that it works well also for the native CUDA and HIP cases.

Would it be ok for you to do that in a subsequent PR?

fwyzard · 2021-10-15T15:48:48Z

Sure.

makortel · 2021-10-15T15:51:18Z

I opened an issue to remind about that #240.

VinInn · 2021-10-16T08:42:14Z

I consider "partial specializations of functions are not allowed" a defect in Alpaka.
The use of "if constexpr" instead of template specialization to terminate recursion is a more general coding pattern far more reaching that the case in hand

fwyzard · 2021-10-16T08:52:28Z

I consider "partial specializations of functions are not allowed" a defect in Alpaka.

Well, maybe a defect of C++ :-/ ?

VinInn · 2021-10-16T09:09:17Z

let's rephrase: The need to specifically template in an intrusive fashion ALL functions with T_ACC is a serious issue in Alpaka

fwyzard · 2021-10-16T10:44:21Z

We don't need to follow the same approach in our code, but I don't think a template parameter is a bag choice.
For example, after working on #241 I think that a template parameter is easier to deal with that a different namespace - especially if we don't want to multiplicate every symbol, object, etc.

makortel added 7 commits October 12, 2021 11:46

[cuda] Remove runtime recursion from find_ntuplets()

a939bf6

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

[cudacompat] Remove runtime recursion from find_ntuplets()

f4208aa

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

[cudadev] Remove runtime recursion from find_ntuplets()

99c9b17

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

[cudauvm] Remove runtime recursion from find_ntuplets()

d2ba743

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

[serial] Remove runtime recursion from find_ntuplets()

601d7e0

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

[alpaka] Remove runtime recursion from find_ntuplets()

eab6472

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

[kokkos] Remove runtime recursion from find_ntuplets()

70e7919

Originally by Vincenzo Innocente in cms-sw/cmssw#35473 Contains also cms-sw/cmssw#35542 by Andrea Bocci

makortel added kokkos alpaka cudacompat cuda serial labels Oct 15, 2021

makortel commented Oct 15, 2021

View reviewed changes

makortel mentioned this pull request Oct 15, 2021

Try out 'if constexpr' instead of specialization in the compile-time recursion of CA #240

Open

makortel merged commit 25ac585 into cms-patatrack:master Oct 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove runtime recursion from find_ntuplets() #239

Remove runtime recursion from find_ntuplets() #239

Uh oh!

makortel commented Oct 15, 2021

Uh oh!

makortel Oct 15, 2021

Uh oh!

fwyzard Oct 15, 2021

Uh oh!

makortel Oct 15, 2021

Uh oh!

fwyzard commented Oct 15, 2021 via email

Uh oh!

makortel commented Oct 15, 2021

Uh oh!

VinInn commented Oct 16, 2021

Uh oh!

fwyzard commented Oct 16, 2021

Uh oh!

VinInn commented Oct 16, 2021

Uh oh!

fwyzard commented Oct 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Remove runtime recursion from find_ntuplets() #239

Remove runtime recursion from find_ntuplets() #239

Uh oh!

Conversation

makortel commented Oct 15, 2021

Uh oh!

makortel Oct 15, 2021

Choose a reason for hiding this comment

Uh oh!

fwyzard Oct 15, 2021

Choose a reason for hiding this comment

Uh oh!

makortel Oct 15, 2021

Choose a reason for hiding this comment

Uh oh!

fwyzard commented Oct 15, 2021 via email

Uh oh!

makortel commented Oct 15, 2021

Uh oh!

VinInn commented Oct 16, 2021

Uh oh!

fwyzard commented Oct 16, 2021

Uh oh!

VinInn commented Oct 16, 2021

Uh oh!

fwyzard commented Oct 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants