GPU-accelerated Finite Element Method using Python and CUDA

GPU-accelerated Finite Element Method using Python and CUDA

This repository includes the work done within the course TRA105 - GPU-accelerated Computational Methods using Python and CUDA, held at Chalmers University. The main contributions are given by Stefano Ribes (ribes dot stefano at gmail dot com), who developed all the high performance code, Kim Louisa Auth (kim dot auth at chalmers dot se), who wrote an initial version of the FEM algorithm, and Fredrik Larsson (Fredrik dot Larsson at chalmers dot se), who supervised the project.

The Jupyter Notebook TRA105_GPU_accelerated_Computational_Methods_using_Python_and_CUDA.ipynb includes most of the project work and can be run in Google Colab.

In the first part of the notebook, we describe the FEM algorithm from a higher point of view. We then report a simple mechanism to generate "large enough" FEM problems. Then, we proceed to evaluate different solver strategies and select the best performing solver algorithm. At this point, we present three different implementation of the K-assembly step in the FEM algorithm. An additional K-assembly implementation based on cell coloring is proposed in the Appendix as a work in progress. In the end, we show an evaluation of the three different proposed K-assembly steps, before drawing our conclusions.

Methodology

The FEM algorithm is mainly divided in two phases: the assembly of stiffness matrix $K$ and the linear solver part.

K-Assembly

In this work, we proposed four different implementations for computing the assembly of the stiffness matrix:

Naïve CPU implementation
Batched CPU implementation via Numpy and Numba
Batched GPU implementation via CuPy
Custom CUDA kernel implementation via Numba

Solver Profiling

As we can see in the following figure, the minres solver is the best performing one, especially in the case of large matrix dimensions. We believe that the main reason for such result lies in the fact that minres is able to best leverage the symmetry and the sparseness of the stiffness matrix compared to the other solver algorithms.

Evaluation and Results

The tests have been conducted over precomputed grids of up to 5 million nodes. The measurements were collected on different devices, namely an Intel 8-cores Xeon processor, an Nvidia Tesla T4 and an Nvidia RTX 2080 Ti.

As expected, the GPU implementation is clearly superior in terms of performance, being as low as 3% of the CPU time and $8.2\times$ faster than CPU on average, with a peak of $27.2\times$.

Final Remarks

The aforementioned notebook contains a more in-depth description and analysis of the proposed designs. It also includes additional notes and future work proposals.

If you found this repository useful, please cite it via:

@software{GPU-accelerated-Finite-Element-Method-using-Python-and-CUDA,
  author = {Stefano Ribes, Kim Louisa Auth, Fredrik Larsson},
  title = {{GPU-accelerated Finite Element Method using Python and CUDA}},
  doi = {https://doi.org/10.5281/zenodo.7688742},
  url = {https://github.com/ribesstefano/GPU-accelerated-Finite-Element-Method-using-Python-and-CUDA.git},
  version = {1.0.0},
  year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github/workflows		.github/workflows
BasicFEM		BasicFEM
data		data
test		test
LICENCE.md		LICENCE.md
README.md		README.md
TRA105 - Final Presentation.pdf		TRA105 - Final Presentation.pdf
TRA105 - Mid-Term Presentation.pdf		TRA105 - Mid-Term Presentation.pdf
TRA105 GPU-accelerated Computational Methods using Python and CUDA.pdf		TRA105 GPU-accelerated Computational Methods using Python and CUDA.pdf
TRA105_GPU_accelerated_Computational_Methods_using_Python_and_CUDA.ipynb		TRA105_GPU_accelerated_Computational_Methods_using_Python_and_CUDA.ipynb
TRA105_Sparse_Linear_Solver_Profiling.ipynb		TRA105_Sparse_Linear_Solver_Profiling.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

BasicFEM

BasicFEM

data

data

test

test

LICENCE.md

LICENCE.md

README.md

README.md

TRA105 - Final Presentation.pdf

TRA105 - Final Presentation.pdf

TRA105 - Mid-Term Presentation.pdf

TRA105 - Mid-Term Presentation.pdf

TRA105 GPU-accelerated Computational Methods using Python and CUDA.pdf

TRA105 GPU-accelerated Computational Methods using Python and CUDA.pdf

TRA105_GPU_accelerated_Computational_Methods_using_Python_and_CUDA.ipynb

TRA105_GPU_accelerated_Computational_Methods_using_Python_and_CUDA.ipynb

TRA105_Sparse_Linear_Solver_Profiling.ipynb

TRA105_Sparse_Linear_Solver_Profiling.ipynb

requirements.txt

requirements.txt

Repository files navigation

GPU-accelerated Finite Element Method using Python and CUDA

Methodology

K-Assembly

Solver Profiling

Evaluation and Results

Final Remarks

About

Releases 1

Packages

Languages

License

ribesstefano/GPU-accelerated-Finite-Element-Method-using-Python-and-CUDA

Folders and files

Latest commit

History

Repository files navigation

GPU-accelerated Finite Element Method using Python and CUDA

Methodology

K-Assembly

Solver Profiling

Evaluation and Results

Final Remarks

About

Topics

Resources

License

Stars

Watchers

Forks

Languages