Testing GPU solvers in SimPEG using CuPy and PyTorch

Typically, MKL Pardiso solver (via Pydiso) is the preffered solver for running simulations in SimPEG.

Large speedups are possible for certain matrix operations using GPUs (https://cupy.dev). I test whether solving Ax = b type problems involving matrix factorization of A and matrix-vector products encountered in geophysics are faster using GPUs. Here, LU solvers from CuPy and PyTorch are used to solve on GPUs, compared against Pardiso and default SimPEG solver which uses SciPy's LU routine.

Results:

For a problem AX=b with A matrix of size (28564, 28564) and b of size 108

Time to run simulation with MKL Pardiso solver: 2.631314992904663 seconds
Time to run simulation with Pytorch LU solver: 16.436639070510864 seconds
Time to run simulation with SimPEG (SciPy LU) solver: 11.663026332855225 seconds
Time to run simulation with CuPy LU solver: 1068.3546755313873 seconds

Pardiso outperforms alternatives here. While matrix-vector products are faster on GPUs, much of the slowdown for PyTorch can be attributed to extremely slow matrix factorization using GPUs. As such, PyTorch solver can be useful in cases where matrix A does not change and large vector b (large number of sources).

CuPy accounts for the slow matrix factorization on GPU by doing LU factorization on CPU using SciPy's LU routines and doing the matrix-vector products on GPUs. However, it was significantly slower in the tests. I attribute the slower performance of CuPy and Pytorch here to the fact that the CuPy and Pytorch solver operations are not yet implemented for sparse matrices [1,2] and thus it factorizes a dense matrix. On the other hand, Pardiso and SimPEG solvers use sparse CSR matrices. I am unsure about the reasons for the large discrepancy between the solve time for Pytorch and CuPy however.

Installation instructions

The install includes some monkey-patching of code, as this is intended to be a quick point-in-time test. Please note the folder paths below carefully :)

To run the project, follow these installation instructions:

Create a conda environment using the provided environment.yml file:
```
conda env create -f environment.yml
```
From simpeg_utils folder, copy files solver_utils_cupy.py and solver_utils_pytorch.py. Paste it in the installed SimPEG package by browsing to <your_conda_install_path>\envs\solvers-test\Lib\site-packages\SimPEG\utils.
From discretize_edits folder, copy the file matrix_utils_cupy.py and paste it in the installed Discretize package by browsing to <your_conda_install_path>\envs\solvers-test\Lib\site-packages\discretize\utils
From the main cupyx_edits folder copy _init_.py. Overwrite _init_.py in the Cupyx->scipy->sparse folder located at <your_conda_install_path>\envs\solvers-test\Lib\site-packages\cupyx\scipy\sparse.
In the cupyx-edits folder, browse to subfolder linalg and copy _init_.py. Overwrite _init_.py this in the linalg subfolder of your environment's Cupyx->scipy->sparse in <your_conda_install_path>\envs\cupy\Lib\site-packages\cupyx\scipy\sparse\linalg.

References

[1] pytorch/pytorch#69538

[2] https://discuss.pytorch.org/t/solving-ax-b-for-sparse-tensors-preferably-with-backward/102331

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
cupyx_edits		cupyx_edits
discretize_edits		discretize_edits
simpeg_utils		simpeg_utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
solvers_test_3d_dcr_v2.ipynb		solvers_test_3d_dcr_v2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Testing GPU solvers in SimPEG using CuPy and PyTorch

Results:

Installation instructions

References

About

Uh oh!

Releases

Packages

Languages

License

pokarparth/simpeg_gpu_solvers

Folders and files

Latest commit

History

Repository files navigation

Testing GPU solvers in SimPEG using CuPy and PyTorch

Results:

Installation instructions

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages