The high computing efficiency of graphics processing units (GPUs) makes them attractive for both high-performance computing and safety-critical applications, such as the automotive and aerospace ones. For both application domains, reliability is a major concern. This paper aims at providing guidelines to improve the reliability of GPUs register file without jeopardizing the device's computing efficiency. We advance the knowledge of GPUs' reliability by investigating register file criticality, which is the probability for a fault in a register to propagate and affect computation. Then, we propose and validate selective fault-tolerance techniques for GPUs register file that can be applied at hardware or software level. Results show that both implementations are well suited to detect faults affecting computation. However, although hardware-implemented techniques are able to detect faults that are triggering a crash, software-implemented techniques may not be sufficient to guarantee sufficient coverage for crashes.
Selective Fault Tolerance for Register Files of Graphics Processing Units / Goncalves, M.; Fernandes, F.; Lamb, I.; Rech, P.; Azambuja, J. R.. - In: IEEE TRANSACTIONS ON NUCLEAR SCIENCE. - ISSN 0018-9499. - 66:7(2019), pp. 1449-1456. [10.1109/TNS.2019.2903027]
Scheda prodotto non validato
I dati visualizzati non sono stati ancora sottoposti a validazione formale da parte dello Staff di IRIS, ma sono stati ugualmente trasmessi al Sito Docente Cineca (Loginmiur).
Titolo: | Selective Fault Tolerance for Register Files of Graphics Processing Units | |
Autori: | Goncalves, M.; Fernandes, F.; Lamb, I.; Rech, P.; Azambuja, J. R. | |
Autori Unitn: | ||
Titolo del periodico: | IEEE TRANSACTIONS ON NUCLEAR SCIENCE | |
Anno di pubblicazione: | 2019 | |
Numero e parte del fascicolo: | 7 | |
Codice identificativo Scopus: | 2-s2.0-85069465931 | |
Digital Object Identifier (DOI): | http://dx.doi.org/10.1109/TNS.2019.2903027 | |
Handle: | http://hdl.handle.net/11572/346729 | |
Citazione: | Selective Fault Tolerance for Register Files of Graphics Processing Units / Goncalves, M.; Fernandes, F.; Lamb, I.; Rech, P.; Azambuja, J. R.. - In: IEEE TRANSACTIONS ON NUCLEAR SCIENCE. - ISSN 0018-9499. - 66:7(2019), pp. 1449-1456. [10.1109/TNS.2019.2903027] | |
Appare nelle tipologie: | 03.1 Articolo su rivista (Journal article) |