Close

Presentation

A Nested Krylov Method Using Half-Precision Arithmetic
DescriptionLow-precision computing is essential for efficiently utilizing memory bandwidth and computing cores. While many mixed-precision algorithms have been developed for iterative sparse linear solvers, effectively leveraging half-precision (fp16) arithmetic remains challenging. This study introduces a novel nested Krylov approach that integrates the flexible GMRES and Richardson methods in a deeply nested structure, progressively reducing precision from double-precision to fp16 toward the innermost solver. To avoid meaningless computations beyond precision limits, the low-precision inner solvers perform only a few iterations per invocation, while the nested structure ensures their frequent execution. Numerical experiments show that incorporating fp16 into the approach directly enhances solver performance without compromising convergence, achieving speedups of up to 2.42 and 1.65 over double-precision and double-single mixed-precision implementations, respectively. Furthermore, the proposed method outperforms conventional mixed-precision Krylov solvers, CG, BiCGStab, and restarted FGMRES, by factors of up to 2.47, 2.74, and 69.10, respectively.