TY - GEN
T1 - A Study of Performance Portability in Plasma Physics Simulations
AU - Ruzicka, Josef
AU - Asch, Christian
AU - Meneses, Esteban
AU - Rampp, Markus
AU - Laure, Erwin
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - The high-performance computing (HPC) community has recently seen a substantial diversification of hardware platforms and their associated programming models. From traditional multicore processors to highly specialized accelerators, vendors and tool developers back up the relentless progress of those architectures. In the context of scientific programming, it is fundamental to consider performance portability frameworks, i.e., software tools that allow programmers to write code once and run it on different computer architectures without sacrificing performance. We report here on the benefits and challenges of performance portability using a field-line tracing simulation and a particle-in-cell code, two relevant applications in computational plasma physics with applications to magnetically-confined nuclear-fusion energy research. For these applications we report performance results obtained on four HPC platforms with server-class CPUs from Intel (Xeon) and AMD (EPYC), and high-end GPUs from Nvidia and AMD, including the latest Nvidia H100 GPU and the novel AMD Instinct MI300A APU. Our results show that both Kokkos and OpenMP are powerful tools to achieve performance portability and decent “out-of-the-box” performance, even for the very latest hardware platforms. For our applications, Kokkos provided performance portability to the broadest range of hardware architectures from different vendors.
AB - The high-performance computing (HPC) community has recently seen a substantial diversification of hardware platforms and their associated programming models. From traditional multicore processors to highly specialized accelerators, vendors and tool developers back up the relentless progress of those architectures. In the context of scientific programming, it is fundamental to consider performance portability frameworks, i.e., software tools that allow programmers to write code once and run it on different computer architectures without sacrificing performance. We report here on the benefits and challenges of performance portability using a field-line tracing simulation and a particle-in-cell code, two relevant applications in computational plasma physics with applications to magnetically-confined nuclear-fusion energy research. For these applications we report performance results obtained on four HPC platforms with server-class CPUs from Intel (Xeon) and AMD (EPYC), and high-end GPUs from Nvidia and AMD, including the latest Nvidia H100 GPU and the novel AMD Instinct MI300A APU. Our results show that both Kokkos and OpenMP are powerful tools to achieve performance portability and decent “out-of-the-box” performance, even for the very latest hardware platforms. For our applications, Kokkos provided performance portability to the broadest range of hardware architectures from different vendors.
KW - Parallel Programming
KW - Performance Portability
KW - Plasma Physics
UR - http://www.scopus.com/inward/record.url?scp=85219195978&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-80084-9_2
DO - 10.1007/978-3-031-80084-9_2
M3 - Contribución a la conferencia
AN - SCOPUS:85219195978
SN - 9783031800832
T3 - Communications in Computer and Information Science
SP - 19
EP - 35
BT - High Performance Computing - 11th Latin American High Performance Computing Conference, CARLA 2024, Revised Selected Papers
A2 - Guerrero, Ginés
A2 - San Martín, Jaime
A2 - Meneses, Esteban
A2 - Barrios Hernández, Carlos Jaime
A2 - Osthoff, Carla
A2 - Monsalve Diaz, Jose M.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 11th Latin American High Performance Computing Conference, CARLA 2024
Y2 - 30 September 2024 through 4 October 2024
ER -