A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers

Bernaschi, M.; Celestini, A.; Vella, F.; D'Ambra, P.

doi:10.1109/TPDS.2023.3287238

We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily integrated into scientific applications that need to solve large and sparse linear systems on modern parallel computers made of hybrid nodes hosting Nvidia Graphics Processing Unit (GPU) accelerators. The work extends previous efforts of some of the authors in the exploitation of a single GPU accelerator and proposes an implementation, based on the hybrid MPI-CUDA software environment, of a Krylov-type linear solver relying on an efficient Algebraic MultiGrid (AMG) preconditioner already available in the BootCMatchG library. Our design for the hybrid implementation has been driven by the best practices for minimizing data communication overhead when multiple GPUs are employed, yet preserving the efficiency of the GPU kernels. Strong and weak scalability results of the new version of the library on well-known benchmark test cases are discussed. Comparisons with the Nvidia AmgX solution show a speedup, in the solve phase, up to 2.0x.

A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers / Bernaschi, M., Celestini, A., Vella, F., D'Ambra, P.. - In: IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS. - ISSN 1045-9219. - ELETTRONICO. - 34:8(2023), pp. 2365-2376. [10.1109/TPDS.2023.3287238]

A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers

Bernaschi, M.;Celestini, A.;Vella, F.;D'Ambra, P.

2023-01-01

Abstract

We present and release in open source format a sparse linear solver which efficiently exploits heterogeneous parallel computers. The solver can be easily integrated into scientific applications that need to solve large and sparse linear systems on modern parallel computers made of hybrid nodes hosting Nvidia Graphics Processing Unit (GPU) accelerators. The work extends previous efforts of some of the authors in the exploitation of a single GPU accelerator and proposes an implementation, based on the hybrid MPI-CUDA software environment, of a Krylov-type linear solver relying on an efficient Algebraic MultiGrid (AMG) preconditioner already available in the BootCMatchG library. Our design for the hybrid implementation has been driven by the best practices for minimizing data communication overhead when multiple GPUs are employed, yet preserving the efficiency of the GPU kernels. Strong and weak scalability results of the new version of the library on well-known benchmark test cases are discussed. Comparisons with the Nvidia AmgX solution show a speedup, in the solve phase, up to 2.0x.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
			
	Numero e parte del fascicolo (Issue number and part)
	
				8
			
	DOI
	
				https://dx.doi.org/10.1109/TPDS.2023.3287238
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85162904655
			
	Codice WOS (WOS identifier)
	
				WOS:001022028500002
			
	Tutti gli autori
	
						Bernaschi, M.; Celestini, A.; Vella, F.; D'Ambra, P.
					
	Citazione
	
				A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers / Bernaschi, M., Celestini, A., Vella, F., D'Ambra, P.. - In: IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS. - ISSN 1045-9219. - ELETTRONICO. - 34:8(2023), pp. 2365-2376. [10.1109/TPDS.2023.3287238]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
A_Multi-GPU_Aggregation-Based_AMG_Preconditioner_for_Iterative_Linear_Solvers.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 983.89 kB Formato Adobe PDF Visualizza/Apri	983.89 kB	Adobe PDF	Visualizza/Apri
Multi_GPU_version_of_an_aggregation_based_Algebraic_MultiGrid_preconditioner_for_iterative_linear_solvers.pdf Open Access dal 02/08/2025 Descrizione: Accepted Manuscript Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 587.94 kB Formato Adobe PDF Visualizza/Apri	587.94 kB	Adobe PDF	Visualizza/Apri