XPU and MPS take 3 #276

jwallwork23 · 2025-02-06T12:02:17Z

Closes #127.
Builds upon #125 and #209.
(Contains changes from #268 so that will need to be merged first.)

This PR adds support XPU and MPS. Unfortunately, it ended up requiring an overhaul of the pt2ts scripts, too.

Notable changes:

Switching from ENABLE_CUDA to the more general and extensible GPU_DEVICE=<NONE/CUDA/XPU/MPS>.
Pre-processor directives for handling different GPU types.
Support for XPU under GPU device code 12.
Support for MPS under GPU device code 13.
Use of argparse for reading command line arguments into Python scripts, rather than sys.argv.
Updates to docs.

Checklist

Test on 2 Nvidia GPUs
Test on 2 XPUs
Test on 1 MPS device

.github/workflows/static_analysis.yml

jwallwork23 · 2025-02-10T16:22:24Z

Offline testing for CUDA version of MultiGPU example with 2 devices passed on Ampere. In the queue for XPU testing on PVC.

jatkinson1000

Thanks @jwallwork23 I have only done a quick pass of this so far and will need to schedule time for a closer look, but I suspect you know my first comment - can you update the docs at utils/README etc. to reflect the new args and usage of pt2ts?

I would also like to see it documented somewhere how the device enums are managed from CMake. Perhaps under the developer docs. As, whilst a very nifty solution, it's slightly abstract if you are not the one who came up with it 😉

jatkinson1000 · 2025-02-11T12:00:17Z

Also looks like you may want a rebase after #268

jwallwork23 · 2025-02-17T09:05:32Z

Re-tested on Dawn with latest version of branch - all good

README.md

jatkinson1000

Thanks for driving all of this forward @jwallwork23 the devices make a fine addition to the collection.

I think we are on the edge of glory, but have a couple of suggestions/clarifications before approval.

More general ruminations on reviewing this:

I wonder if there is a better way to keep the pt2ts files in sync... (not to be resolved in this PR)

.github/workflows/static_analysis.yml

CMakeLists.txt

jatkinson1000 · 2025-02-19T08:19:47Z

conda/README.md

I'll need to update the mac conda to include this in #284

examples/1_SimpleNet/simplenet_infer_python.py

src/ftorch.fypp

src/ctorch.h

utils/README.md

examples/3_MultiGPU/multigpu_infer_python.py

examples/3_MultiGPU/CMakeLists.txt

Co-authored-by: Jack Atkinson <[email protected]>

jwallwork23 · 2025-02-19T13:27:06Z

More general ruminations on reviewing this:

* I wonder if there is a better way to keep the pt2ts files in sync... (not to be resolved in this PR)

Yeah agreed. Perhaps the script could be set up such that it doesn't need to be modified, although that might be a tall order.

jwallwork23 · 2025-02-19T18:53:30Z

Latest version now passes tests on Ampere. Awaiting XPU job on PVC.

jwallwork23 · 2025-02-20T12:58:43Z

Latest version now passes tests on Ampere. Awaiting XPU job on PVC.

Passed on PVC, too!

jatkinson1000 · 2025-02-20T17:27:10Z

Tests run as ~~expected~~ hoped on MPS.

jatkinson1000

I think I am now happy with everything here @jwallwork23 .
Ran all OK on Mac (conda, with MPS to run MultiGPU example) and you checked PVC, CUDA, and CPU.

Fantastic stuff.

You may merge when ready (I would lean towards a squash based on the commit history. The PR touches a lot of files, but generally fairly specific lines in each).

examples/3_MultiGPU/CMakeLists.txt

jwallwork23 · 2025-02-20T17:36:17Z

Amazing! Thanks for the thorough review @jatkinson1000 - will merge now.

* Add MacOS GPU device option * Add XPU device option * Update C++ XPU interface to handle multiple devices indices. * Update ftorch.F90 for XPU support * Make device enums consistent with PyTorch * Accept command line arguments in MultiGPU example * Introduce GPU_DEVICE preprocessor option * Update pt2ts scripts; use argparse over sys.argv * Update GPU docs * Update READMEs * Add explanation of GPU device codes in dev docs --------- Co-authored-by: ElliottKasoar <[email protected]> Co-authored-by: Jack Atkinson <[email protected]> Co-authored-by: Matt Archer <[email protected]> Co-authored-by: Jack Atkinson <[email protected]>

jwallwork23 added enhancement New feature or request gpu Related to buiding and running on GPU labels Feb 6, 2025

jwallwork23 self-assigned this Feb 6, 2025

jwallwork23 force-pushed the 127_xpu-take3 branch from 4aa9a4a to 55b929f Compare February 7, 2025 13:47

jwallwork23 commented Feb 10, 2025

View reviewed changes

.github/workflows/static_analysis.yml Show resolved Hide resolved

jwallwork23 marked this pull request as ready for review February 10, 2025 16:21

jwallwork23 requested review from jatkinson1000 and TomMelt February 10, 2025 16:21

jwallwork23 requested a review from ma595 February 10, 2025 16:22

This was referenced Feb 11, 2025

Add MPS and XPU devices #125

Closed

Add XPU support (duplicate #125) #209

Closed

jatkinson1000 requested changes Feb 11, 2025

View reviewed changes

ElliottKasoar and others added 16 commits February 11, 2025 12:09

Add MacOS GPU device option

f3a1914

Add XPU device option

42bcd61

Update C++ XPU interface to handle multiple devices indices.

7039084

Update ftorch.F90 for XPU support

2a30f6a

Make device enums consistent with PyTorch

f58fa92

Add ENABLE_XPU option for CMakeLists

0e5e56e

Move towards generalising device type in MultiGPU example

32a2b42

Accept command line arguments in MultiGPU example

1c47839

Account for MPS

9adf424

Account for XPU in MultiGPU README

1cb8cfd

Lint

999e1c8

Fix argparse syntax

47e4276

Pre-processing for different GPU devices

a422df3

Add mps to options for MultiGPU simplenet

017f36e

Introduce GPU_DEVICE preprocessor option

00e6a36

CMake lint

ba99847

Fix build path static analysis

8ff6be2

ElliottKasoar reviewed Feb 17, 2025

View reviewed changes

README.md Show resolved Hide resolved

Respond to @ElliotKasoar review

0897b20

jwallwork23 requested a review from ElliottKasoar February 18, 2025 10:36

jatkinson1000 reviewed Feb 19, 2025

View reviewed changes

jwallwork23 and others added 10 commits February 19, 2025 10:27

Apply suggestions from @jatkinson1000 code review

9b3db7d

Co-authored-by: Jack Atkinson <[email protected]>

Drop intel-specific compiler flag

b3ad229

Make README footnote more general

d87766b

CMake lint

1f4816f

Add note on passing device codes via pre-processor

0cb0208

Make CMake docs footnote more general

cf80689

Add filepath arg for multigpu_infer_python [skip ci]

0ce326d

Asserts for MultiGPU example

e5962af

MPS tests; consistent args ordering

03054de

Python lint

69a4f45

jwallwork23 mentioned this pull request Feb 19, 2025

JOSS paper submission #200

Merged

jwallwork23 requested a review from jatkinson1000 February 19, 2025 11:53

jwallwork23 added 3 commits February 19, 2025 16:04

Test fixes; drop unnecessary imports

7d1f098

Add missing filepath pass [skip ci]

29fc909

Fix expected value in Python, too [skip ci]

ac00fb3

jatkinson1000 approved these changes Feb 20, 2025

View reviewed changes

examples/3_MultiGPU/CMakeLists.txt Show resolved Hide resolved

jwallwork23 merged commit 720b067 into main Feb 20, 2025

jwallwork23 deleted the 127_xpu-take3 branch February 20, 2025 17:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XPU and MPS take 3 #276

XPU and MPS take 3 #276

jwallwork23 commented Feb 6, 2025 •

edited

Loading

jwallwork23 commented Feb 10, 2025

jatkinson1000 left a comment •

edited

Loading

jatkinson1000 commented Feb 11, 2025

jwallwork23 commented Feb 17, 2025

jatkinson1000 left a comment

jatkinson1000 Feb 19, 2025

jwallwork23 commented Feb 19, 2025

jwallwork23 commented Feb 19, 2025

jwallwork23 commented Feb 20, 2025

jatkinson1000 commented Feb 20, 2025

jatkinson1000 left a comment

jwallwork23 commented Feb 20, 2025

XPU and MPS take 3 #276

XPU and MPS take 3 #276

Conversation

jwallwork23 commented Feb 6, 2025 • edited Loading

Checklist

jwallwork23 commented Feb 10, 2025

jatkinson1000 left a comment • edited Loading

Choose a reason for hiding this comment

jatkinson1000 commented Feb 11, 2025

jwallwork23 commented Feb 17, 2025

jatkinson1000 left a comment

Choose a reason for hiding this comment

jatkinson1000 Feb 19, 2025

Choose a reason for hiding this comment

jwallwork23 commented Feb 19, 2025

jwallwork23 commented Feb 19, 2025

jwallwork23 commented Feb 20, 2025

jatkinson1000 commented Feb 20, 2025

jatkinson1000 left a comment

Choose a reason for hiding this comment

jwallwork23 commented Feb 20, 2025

jwallwork23 commented Feb 6, 2025 •

edited

Loading

jatkinson1000 left a comment •

edited

Loading