Non-Uniform Partitioning for Collaborative Execution on Heterogeneous Architectures

Freytag, G.; Serpa, M. S.; Lima, J. V. F.; Rech, P.; Navaux, P. O. A.

doi:10.1109/SBAC-PAD.2019.00031

Since the demand for computing power increases, new architectures arise to obtain better performance. An important class of integrated devices is heterogeneous architectures, which join different specialized hardware into a single chip, composing a System on Chip - SoC. Within this context, effectively splitting tasks between the different architectures is primal to obtain efficiency and performance. In this work, we evaluate two heterogeneous architectures: one composed of a general-purpose CPU and a graphics processing unit (GPU) integrated into a single chip (AMD Kaveri SoC), and another composed by a general-purpose CPU and a Field Programmable Gate Array (FPGA) integrated into a single chip (Intel Arria 10 SoC). We investigate how data partitioning affects the performance of each device in a collaborative execution through the decomposition of the data domain. As a case study, we apply the technique in the well-known Lattice Boltzmann Method (LBM), analyzing the performance of five kernels in both architectures. Our experimental results show that non-uniform partitioning improves LBM kernels performance by up to 11.40% and 15.15% on AMD Kaveri and Intel Arria 10, respectively.

Non-Uniform Partitioning for Collaborative Execution on Heterogeneous Architectures / Freytag, G.; Serpa, M. S.; Lima, J. V. F.; Rech, P.; Navaux, P. O. A.. - 2019-:(2019), pp. 128-135. ( 31st International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2019 Campo Grande, Brazil 15-18 October 2019) [10.1109/SBAC-PAD.2019.00031].

Non-Uniform Partitioning for Collaborative Execution on Heterogeneous Architectures

Freytag G.;Serpa M. S.;Lima J. V. F.;Rech P.;Navaux P. O. A.

2019-01-01

Abstract

Since the demand for computing power increases, new architectures arise to obtain better performance. An important class of integrated devices is heterogeneous architectures, which join different specialized hardware into a single chip, composing a System on Chip - SoC. Within this context, effectively splitting tasks between the different architectures is primal to obtain efficiency and performance. In this work, we evaluate two heterogeneous architectures: one composed of a general-purpose CPU and a graphics processing unit (GPU) integrated into a single chip (AMD Kaveri SoC), and another composed by a general-purpose CPU and a Field Programmable Gate Array (FPGA) integrated into a single chip (Intel Arria 10 SoC). We investigate how data partitioning affects the performance of each device in a collaborative execution through the decomposition of the data domain. As a case study, we apply the technique in the well-known Lattice Boltzmann Method (LBM), analyzing the performance of five kernels in both architectures. Our experimental results show that non-uniform partitioning improves LBM kernels performance by up to 11.40% and 15.15% on AMD Kaveri and Intel Arria 10, respectively.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2019
			
	Titolo del volume (Proceedings title)
	
				2019 31st International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
			
	Luogo di edizione (Place of publication)
	
				New York, USA
			
	Casa editrice (Publisher)
	
				IEEE Computer Society
			
	ISBN
	
				978-1-7281-4194-7
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85076888672
			
	Codice WOS (WOS identifier)
	
				WOS:000521762000018
			
	Tutti gli autori
	
						Freytag, G.; Serpa, M. S.; Lima, J. V. F.; Rech, P.; Navaux, P. O. A.
					
	Citazione
	
				Non-Uniform Partitioning for Collaborative Execution on Heterogeneous Architectures / Freytag, G.; Serpa, M. S.; Lima, J. V. F.; Rech, P.; Navaux, P. O. A.. - 2019-:(2019), pp. 128-135. ( 31st International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2019 Campo Grande, Brazil 15-18 October 2019) [10.1109/SBAC-PAD.2019.00031].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
Non-uniform_Partitioning_for_Collaborative_Execution_on_Heterogeneous_Architectures.pdf Solo gestori archivio Descrizione: IEEE Explore - conference paper Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 302.83 kB Formato Adobe PDF Visualizza/Apri	302.83 kB	Adobe PDF	Visualizza/Apri