Blocking Sparse Matrices to Leverage Dense-Specific Multiplication

IRIS

Research to accelerate matrix multiplication, pushed by the growing computational demands of deep learning, has sprouted many efficient architectural solutions, such as NVIDIA's Tensor Cores. These accelerators are designed to process efficiently a high volume of small dense matrix products in parallel. However, it is not obvious how to leverage these accelerators for sparse matrix multiplication. A natural way to adapt the accelerators to this problem is to divide the matrix into small blocks, and then multiply only the nonzero blocks. In this paper, we investigate ways to reorder the rows of a sparse matrix to reduce the number of nonzero blocks and cluster the nonzero elements into a few dense blocks. While this pre-processing can be computationally expensive, we show that the high speed-up provided by the accelerators can easily repay the cost, especially when several multiplications follow one reordering.

Blocking Sparse Matrices to Leverage Dense-Specific Multiplication / Labini, P. S.; Bernaschi, M.; Nutt, W.; Silvestri, F.; Vella, F.. - ELETTRONICO. - (2022), pp. 19-24. (Intervento presentato al convegno 2022 Workshop on Irregular Applications: Architectures and Algorithms, IA3 2022 tenutosi a Dallas, TX, USA nel 13-18 November, 2022) [10.1109/IA356718.2022.00009].

Blocking Sparse Matrices to Leverage Dense-Specific Multiplication

Bernaschi M.;Nutt W.;Silvestri F.;Vella F.^Ultimo

2022-01-01

Abstract

Research to accelerate matrix multiplication, pushed by the growing computational demands of deep learning, has sprouted many efficient architectural solutions, such as NVIDIA's Tensor Cores. These accelerators are designed to process efficiently a high volume of small dense matrix products in parallel. However, it is not obvious how to leverage these accelerators for sparse matrix multiplication. A natural way to adapt the accelerators to this problem is to divide the matrix into small blocks, and then multiply only the nonzero blocks. In this paper, we investigate ways to reorder the rows of a sparse matrix to reduce the number of nonzero blocks and cluster the nonzero elements into a few dense blocks. While this pre-processing can be computationally expensive, we show that the high speed-up provided by the accelerators can easily repay the cost, especially when several multiplications follow one reordering.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del volume (Proceedings title)
	
				Proceedings of IA3 2022: Workshop on Irregular Applications: Architectures and Algorithms, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis
			
	Luogo di edizione (Place of publication)
	
				10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, CA 90720-1264 USA
			
	Casa editrice (Publisher)
	
				Institute of Electrical and Electronics Engineers Inc.
			
	ISBN
	
				978-1-6654-7506-8
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85148075284
			
	Codice WOS (WOS identifier)
	
				WOS:000965062100003
			
	Tutti gli autori
	
						Labini, P. S.; Bernaschi, M.; Nutt, W.; Silvestri, F.; Vella, F.
					
	Citazione
	
				Blocking Sparse Matrices to Leverage Dense-Specific Multiplication / Labini, P. S.; Bernaschi, M.; Nutt, W.; Silvestri, F.; Vella, F.. - ELETTRONICO. - (2022), pp. 19-24. (Intervento presentato al  convegno 2022 Workshop on Irregular Applications: Architectures and Algorithms, IA3 2022 tenutosi a Dallas, TX, USA nel 13-18 November, 2022) [10.1109/IA356718.2022.00009].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
Blocking_Sparse_Matrices_to_Leverage_Dense-Specific_Multiplication.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 847.12 kB Formato Adobe PDF Visualizza/Apri	847.12 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/372887

Citazioni

ND

4

3

ND

social impact