Automated Fault Tree Learning from Continuous-valued Sensor Data: A Case Study on Domestic Heaters

Verkuil, B.; Budde, C. E.; Bucur, D.

doi:10.36001/IJPHM.2022.v13i2.3160

Many industrial sectors have been collecting big sensor data. With recent technologies for processing big data, companies can exploit this for automatic failure detection and prevention. We propose the first completely automated method for failure analysis, machine-learning fault trees from raw observational data with continuous variables. Our method scales well and is tested on a real-world, five-year dataset of domestic heater operations in The Netherlands, with 31 million unique heater-day readings, each containing 27 sensor and 11 failure variables. Our method builds on two previous procedures: the C4.5 decision-tree learning algorithm, and the LIFT fault tree learning algorithm from Boolean data. C4.5 pre-processes each continuous variable: it learns an optimal numerical threshold which distinguishes between faulty and normal operation of the top-level system. These thresholds discretise the variables, thus allowing LIFT to learn fault trees which model the root failure mechanisms of the system and are explainable. We obtain fault trees for the 11 failure variables, and evaluate them in two ways: quantitatively, with a significance score, and qualitatively, with domain specialists. Some of the fault trees learnt have almost maximum significance (above 0.95), while others have medium-to-low significance (around 0.30), reflecting the difficulty of learning from big, noisy, real-world sensor data. The domain specialists confirm that the fault trees model meaningful relationships among the variables.

Automated Fault Tree Learning from Continuous-valued Sensor Data: A Case Study on Domestic Heaters / Verkuil, B., Budde, C.E., Bucur, D.. - In: INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT. - ISSN 2153-2648. - ELETTRONICO. - 13:2(2022), pp. 1-12. [10.36001/IJPHM.2022.v13i2.3160]

Automated Fault Tree Learning from Continuous-valued Sensor Data: A Case Study on Domestic Heaters

Verkuil B.;Budde C. E.;Bucur D.

2022-01-01

Abstract

Many industrial sectors have been collecting big sensor data. With recent technologies for processing big data, companies can exploit this for automatic failure detection and prevention. We propose the first completely automated method for failure analysis, machine-learning fault trees from raw observational data with continuous variables. Our method scales well and is tested on a real-world, five-year dataset of domestic heater operations in The Netherlands, with 31 million unique heater-day readings, each containing 27 sensor and 11 failure variables. Our method builds on two previous procedures: the C4.5 decision-tree learning algorithm, and the LIFT fault tree learning algorithm from Boolean data. C4.5 pre-processes each continuous variable: it learns an optimal numerical threshold which distinguishes between faulty and normal operation of the top-level system. These thresholds discretise the variables, thus allowing LIFT to learn fault trees which model the root failure mechanisms of the system and are explainable. We obtain fault trees for the 11 failure variables, and evaluate them in two ways: quantitatively, with a significance score, and qualitatively, with domain specialists. Some of the fault trees learnt have almost maximum significance (above 0.95), while others have medium-to-low significance (around 0.30), reflecting the difficulty of learning from big, noisy, real-world sensor data. The domain specialists confirm that the fault trees model meaningful relationships among the variables.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del periodico (Journal title)
	
				INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT
			
	Numero e parte del fascicolo (Issue number and part)
	
				2
			
	DOI
	
				https://dx.doi.org/10.36001/IJPHM.2022.v13i2.3160
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85135139918
			
	Codice WOS (WOS identifier)
	
				WOS:000930509000001
			
	Tutti gli autori
	
						Verkuil, B.; Budde, C. E.; Bucur, D.
					
	Citazione
	
				Automated Fault Tree Learning from Continuous-valued Sensor Data: A Case Study on Domestic Heaters / Verkuil, B., Budde, C.E., Bucur, D.. - In: INTERNATIONAL JOURNAL OF PROGNOSTICS AND HEALTH MANAGEMENT. - ISSN 2153-2648. - ELETTRONICO. - 13:2(2022), pp. 1-12. [10.36001/IJPHM.2022.v13i2.3160]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
3160-Full-Length Manuscripts-11229-1-10-20220731-1.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 1.39 MB Formato Adobe PDF Visualizza/Apri	1.39 MB	Adobe PDF	Visualizza/Apri