First draft for the section on atomics #34

PaulGannay · 2025-09-22T13:37:46Z

No description provided.

pzehner

This is better with the tables, indeed. I think that the layout of the slides should be modified to separate the code from its execution.

pzehner · 2025-12-12T13:24:28Z

courses/02_intermediate/main.tex

+% Trainee could play with the following program to check that it really present a race condition:
+%#include <iostream>
+%#include <Kokkos_Core.hpp>
+%
+%int main(int argc, char *argv[]) {
+%  Kokkos::initialize(argc, argv); 
+%  {
+%    const int N = 10000;
+%    Kokkos::View<double*> v("v", N);
+%    Kokkos::deep_copy(v, 4);
+%
+%    Kokkos::View<double> res("res", N);
+%
+%    Kokkos::parallel_for(Kokkos::RangePolicy(0, N),
+%        KOKKOS_LAMBDA(int i) {
+%        //Kokkos::atomic_add(&res(), v(i));
+%        res() = res() + v(i);
+%        });
+%
+%    double res_;
+%
+%    deep_copy(res_, res);
+%
+%    std::cout << "res_:" << res_ << std::endl;
+%    std::cout << "4*N:" << 4*N << std::endl;
+%  }
+%  Kokkos::finalize();
+%}


It's a good idea for an exercise.

courses/02_intermediate/main.tex

pzehner · 2025-12-12T13:47:28Z

courses/02_intermediate/main.tex

+    \item they bypass and invalidate cache line.
+  \end{itemize}
+
+  => Atomics should be used with care and only when strictly necessary.\linebreak


Add that sometimes, developers should change an algorithm that depends on atomics.

This is especially true for algorithms that iterate over faces of a mesh, then over the two cells neighboring the face. This pattern is very common for unstructured CFD codes that run on CPU, because then you compute the flux between the two cells only once. This can be ported to GPU as is, but sometimes the best strategy is actually to rewrite the algorithm to iterate over the cells directly.

I think that you are right, but it deserves its own set of slides. I didn't had time today, I'll see later.
Thank you for the proposed example.

courses/02_intermediate/main.tex

pzehner · 2025-12-12T13:53:45Z

courses/02_intermediate/main.tex

+  For some of your needs, more performant alternative exist, like \texttt{parallel\_reduce} or \texttt{Kokkos::ScatterView}.
+\end{frame}


Beware, because on GPU a ScatterView will not give you any performance gain.

Following a discussion I had this morning, maybe it's better to not talk about ScatterView in this tutorial. It's still experimental and maybe less suited for nowadays CPUs. Especially, since it relies on data duplication on CPU, it may be counterproductive on CPUs with a very large number of threads (say, more than 100).

Signed-off-by: Paul Gannay <[email protected]>

Co-authored-by: Paul Zehner <[email protected]>

pzehner

I think it's almost done. Just some fixes.

pzehner · 2025-12-19T12:16:44Z

courses/02_intermediate/main.tex

+    \begin{column}{0.5\linewidth}
+      \begin{minted}{C++}
+        Kokkos::View<double*> histo(5);
+        Kokkos::deep_copy(histo, 0);


I don't think you need to manually initialize to 0, the view does it by default.

(Same remark for the other slides.)

Are you sure this is guaranteed and not a side effect of memory allocation?
I find the doc not very clear on this subject, all allocating constructor have this text:

The initialization is executed on the default instance of the execution space corresponding to memory_space and fences it.

but it doesn't explain what kind of initialisation takes place for default types.

I asked Adrien (he worked on View initialisation), and he confirmed that you are right, I will delete the extra deep_copy.

courses/02_intermediate/main.tex

pzehner · 2025-12-19T12:37:35Z

courses/02_intermediate/main.tex

+\colorlet{thread1}{gray!25}
+\colorlet{thread6}{example!25}


I would select two tones of gray instead

Suggested change

\colorlet{thread1}{gray!25}

\colorlet{thread6}{example!25}

\colorlet{thread1}{gray!20}

\colorlet{thread6}{gray!40}

Or plainly use colors:

Suggested change

\colorlet{thread1}{gray!25}

\colorlet{thread6}{example!25}

\colorlet{thread1}{lightalert}

\colorlet{thread6}{lightexample}

I initially tried with the different levels of gray but found it hard to read, especially on slide 30.

The light red + light blue looks nice in colour but is harder to differentiate in B&W.
I'll do the change, we'll revert it if you think readability in B&W is important.

courses/02_intermediate/main.tex

Co-authored-by: Paul Zehner <[email protected]>

PaulGannay force-pushed the atomics branch from 10fc972 to 78d3b6a Compare September 22, 2025 13:59

PaulGannay marked this pull request as draft December 10, 2025 15:33

pzehner reviewed Dec 12, 2025

View reviewed changes

pzehner mentioned this pull request Dec 18, 2025

Intermediate course #37

Open

PaulGannay and others added 7 commits December 19, 2025 10:43

First draft for section on atomics

b39e750

WIP

57d9e9f

Rework race condition section

1118350

Update the atomic section with remarks from review

202c636

Signed-off-by: Paul Gannay <[email protected]>

Apply suggestion from @pzehner

ec614b4

Co-authored-by: Paul Zehner <[email protected]>

Update with review comments

5a8b7b1

Add section on Atomic traits

a43f031

PaulGannay force-pushed the atomics branch from 7dcc936 to a43f031 Compare December 19, 2025 09:44

PaulGannay marked this pull request as ready for review December 19, 2025 09:44

pzehner requested changes Dec 19, 2025

View reviewed changes

PaulGannay and others added 4 commits December 19, 2025 14:12

Apply suggestions from code review

83b62b1

Co-authored-by: Paul Zehner <[email protected]>

Change colour coding for threads in array

069c84a

Apply review comments

2483079

Remove useless s

22b7c21

		For some of your needs, more performant alternative exist, like \texttt{parallel\_reduce} or \texttt{Kokkos::ScatterView}.
		\end{frame}

First draft for the section on atomics #34

Are you sure you want to change the base?

First draft for the section on atomics #34

Uh oh!

Conversation

PaulGannay commented Sep 22, 2025

Uh oh!

pzehner left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pzehner left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PaulGannay Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PaulGannay Dec 19, 2025 •

edited

Loading