Identifying the Most Reliable Collaborative Workload Distribution in Heterogeneous Devices

IRIS

The constant need of higher performances and reduced power consumption has lead vendors to design heterogeneous devices that embed traditional CPU and an accelerator, like a GPU or FPGA. When the CPU and the accelerator are used collaboratively the device computational performances reach their peak. However, the higher amount of resources employed for computation has, potentially, the side effect of increasing soft error rate. In this paper we evaluate the reliability behavior of AMD Kaveri Accelerated Processing Units executing a set of heterogeneous applications. We distribute the workload between the CPU and GPU and evaluate which configuration provides the lowest error rate or allows the computation of the highest amount of data before experiencing a failure. We show that, in most cases, the most reliable workload distribution is the one that delivers the highest performances. As experimentally proven, by choosing the correct workload distribution the device reliability can increase of up to 9x.

Identifying the Most Reliable Collaborative Workload Distribution in Heterogeneous Devices / Davila, G. P.; Oliveira, D.; Navaux, P.; Rech, P.. - (2019), pp. 1325-1330. ( 22nd Design, Automation and Test in Europe Conference and Exhibition, DATE 2019 Firenze Fiera, ita 2019) [10.23919/DATE.2019.8715107].

Identifying the Most Reliable Collaborative Workload Distribution in Heterogeneous Devices

Davila G. P.;Oliveira D.;Navaux P.;Rech P.

2019-01-01

Abstract

The constant need of higher performances and reduced power consumption has lead vendors to design heterogeneous devices that embed traditional CPU and an accelerator, like a GPU or FPGA. When the CPU and the accelerator are used collaboratively the device computational performances reach their peak. However, the higher amount of resources employed for computation has, potentially, the side effect of increasing soft error rate. In this paper we evaluate the reliability behavior of AMD Kaveri Accelerated Processing Units executing a set of heterogeneous applications. We distribute the workload between the CPU and GPU and evaluate which configuration provides the lowest error rate or allows the computation of the highest amount of data before experiencing a failure. We show that, in most cases, the most reliable workload distribution is the one that delivers the highest performances. As experimentally proven, by choosing the correct workload distribution the device reliability can increase of up to 9x.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2019
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 2019 Design, Automation and Test in Europe Conference and Exhibition, DATE 2019
			
	Luogo di edizione (Place of publication)
	
				stati uniti
			
	Casa editrice (Publisher)
	
				Institute of Electrical and Electronics Engineers Inc.
			
	ISBN
	
				978-3-9819263-2-3
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85066613990
			
	Codice WOS (WOS identifier)
	
				WOS:000470666100246
			
	Tutti gli autori
	
						Davila, G. P.; Oliveira, D.; Navaux, P.; Rech, P.
					
	Citazione
	
				Identifying the Most Reliable Collaborative Workload Distribution in Heterogeneous Devices / Davila, G. P.; Oliveira, D.; Navaux, P.; Rech, P.. - (2019), pp. 1325-1330. ( 22nd Design, Automation and Test in Europe Conference and Exhibition, DATE 2019 Firenze Fiera, ita 2019) [10.23919/DATE.2019.8715107].

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/403750

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

6

5

ND

social impact