Quality–diversity optimization of decision trees for interpretable reinforcement learning

IRIS

In the current Artificial Intelligence (AI) landscape, addressing explainability and interpretability in Machine Learning (ML) is of critical importance. In fact, the vast majority of works on AI focus on Deep Neural Networks (DNNs), which are not interpretable, as they are extremely hard to inspect and understand for humans. This is a crucial disadvantage of these methods, which hinders their trustability in high-stakes scenarios. On the other hand, interpretable models are considerably easier to inspect, which allows humans to test them exhaustively, and thus trust them. While the fields of eXplainable Artificial Intelligence (XAI) and Interpretable Artificial Intelligence (IAI) are progressing in supervised settings, the field of Interpretable Reinforcement Learning (IRL) is falling behind. Several approaches leveraging Decision Trees (DTs) for IRL have been proposed in recent years. However, all of them use goal-directed optimization methods, which may have limited exploration capabilities. In this work, we extend a previous study on the applicability of Quality–Diversity (QD) algorithms to the optimization of DTs for IRL. We test the methods on two well-known Reinforcement Learning (RL) benchmark tasks from OpenAI Gym, comparing their results in terms of score and “illumination” patterns. We show that using QD algorithms is an effective way to explore the search space of IRL models. Moreover, we find that, in the context of DTs for IRL, QD approaches based on MAP-Elites (ME) and its variant Covariance Matrix Adaptation MAP-Elites (CMA-ME) can significantly improve convergence speed over the goal-directed approaches.

Quality–diversity optimization of decision trees for interpretable reinforcement learning / Ferigo, Andrea; Custode, Leonardo Lucio; Iacca, Giovanni. - In: NEURAL COMPUTING & APPLICATIONS. - ISSN 0941-0643. - 2023:(2023). [10.1007/s00521-023-09124-5]

Quality–diversity optimization of decision trees for interpretable reinforcement learning

Ferigo, Andrea;Custode, Leonardo Lucio;Iacca, Giovanni

2023-01-01

Abstract

In the current Artificial Intelligence (AI) landscape, addressing explainability and interpretability in Machine Learning (ML) is of critical importance. In fact, the vast majority of works on AI focus on Deep Neural Networks (DNNs), which are not interpretable, as they are extremely hard to inspect and understand for humans. This is a crucial disadvantage of these methods, which hinders their trustability in high-stakes scenarios. On the other hand, interpretable models are considerably easier to inspect, which allows humans to test them exhaustively, and thus trust them. While the fields of eXplainable Artificial Intelligence (XAI) and Interpretable Artificial Intelligence (IAI) are progressing in supervised settings, the field of Interpretable Reinforcement Learning (IRL) is falling behind. Several approaches leveraging Decision Trees (DTs) for IRL have been proposed in recent years. However, all of them use goal-directed optimization methods, which may have limited exploration capabilities. In this work, we extend a previous study on the applicability of Quality–Diversity (QD) algorithms to the optimization of DTs for IRL. We test the methods on two well-known Reinforcement Learning (RL) benchmark tasks from OpenAI Gym, comparing their results in terms of score and “illumination” patterns. We show that using QD algorithms is an effective way to explore the search space of IRL models. Moreover, we find that, in the context of DTs for IRL, QD approaches based on MAP-Elites (ME) and its variant Covariance Matrix Adaptation MAP-Elites (CMA-ME) can significantly improve convergence speed over the goal-directed approaches.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del periodico (Journal title)
	
				NEURAL COMPUTING & APPLICATIONS
			
	DOI
	
				https://dx.doi.org/10.1007/s00521-023-09124-5
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85176297335
			
	Codice WOS (WOS identifier)
	
				WOS:001100507200001
			
	Tutti gli autori
	
						Ferigo, Andrea; Custode, Leonardo Lucio; Iacca, Giovanni
					
	Citazione
	
				Quality–diversity optimization of decision trees for interpretable reinforcement learning / Ferigo, Andrea; Custode, Leonardo Lucio; Iacca, Giovanni. - In: NEURAL COMPUTING & APPLICATIONS. - ISSN 0941-0643. - 2023:(2023). [10.1007/s00521-023-09124-5]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
s00521-023-09124-5.pdf accesso aperto Descrizione: first online Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 651.74 kB Formato Adobe PDF Visualizza/Apri	651.74 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/397769

Citazioni

ND

4

3

ND

social impact