SC20 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Using Hierarchical Parallelism to Accelerate the Solution of Many Small Partial Differential Equations


Workshop:HiPar20: Workshop on Hierarchical Parallelism for Exascale Computing

Authors: Jacob Merson and Mark S. Shephard (Rensselaer Polytechnic Institute (RPI))


Abstract: This paper presents efforts to improve the hierarchical parallelism of a two scale simulation code. Two methods to improve the GPU parallel performance were developed and compared. The first used the Nvidia Multi-Process Service and the second moved the entire sub-problem loop into a single kernel using Kokkos hierarchical parallelism and a PackedView data structure. Both approaches improved parallel performance with the second method providing the greatest improvements.





Back to HiPar20: Workshop on Hierarchical Parallelism for Exascale Computing Archive Listing



Back to Full Workshop Archive Listing