LU Y C, YUAN Y X, YANG D C, et al. Tensor Core accelerated conjugate gradient solver on GPUsJ. Journal of University of Electronic Science and Technology of China, 2026, 55(2): 244-251. DOI: 10.12178/1001-0548.2024358
Citation: LU Y C, YUAN Y X, YANG D C, et al. Tensor Core accelerated conjugate gradient solver on GPUsJ. Journal of University of Electronic Science and Technology of China, 2026, 55(2): 244-251. DOI: 10.12178/1001-0548.2024358

Tensor Core accelerated conjugate gradient solver on GPUs

  • Conjugate gradient (CG) and biconjugate gradient stabilized (BiCGSTAB) are two classical and efficient iterative methods for solving sparse linear systems, widely used in scientific computing and engineering applications. Although GPUs and other parallel processors have enhanced the parallelism of these methods, the latest hardware unit, Tensor Core, and its computing power have not yet been fully exploited for these two methods. This work proposes a Tensor Core-accelerated CG solver that leverages Tensor Cores for the key components in the CG and BiCGSTAB methods, such as sparse matrix-vector multiplication (SpMV) and dot product computation, thereby exploiting the computational capability of Tensor Cores to improve the overall performance of both methods. Experimental results on NVIDIA A100 and H100 GPUs demonstrate that both of these methods accelerated by Tensor Core achieve significant speedups over the baseline version that uses the CUDA official library on various sparse matrices.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return