Close

Presentation

Explicit Low-Order Finite-Element Wave Simulation Accelerated with Variable-Precision Computing Using INT8 Tensor Cores
DescriptionUsing low-precision cores for acceleration of PDE-based simulations with sparse or small matrices is often challenging due to the frequent data conversion between high- and low-precision variables, and that the required precision varies in time/space due to the heterogeneity of the target problem. As an example of accelerating such PDE-based simulations, we develop an integer-based variable-precision computing method with low data-conversion costs for low-order explicit finite-element wave propagation simulations. Here, the precision level used for solving the problem is chosen locally to attain simulation accuracy, and is accelerated using INT8 Tensor Cores. This leads to a 3.3-fold speedup from a baseline FP64 CUDA-core-based implementation with equivalent simulation accuracy, with 87% weak efficiency up to 256 compute nodes of the GH200-based Miyabi supercomputer. These ideas are expected to be useful for accelerating other PDE-based problems with sparse or small matrices on computer architectures with high-performance, low-precision cores.