A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning

Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni

doi:10.1007/978-3-031-90062-4_25

Thanks to the advances in deep Reinforcement Learning (RL) and its demonstrated capabilities to perform complex tasks, the field of Multi-Agent RL (MARL) has recently undergone major developments. However, current MARL approaches based on deep learning still suffer from a general lack of interpretability. Recently, hybrid models combining Decision Trees (DTs) with simple leaves running Q-Learning have been proposed as an alternative to achieve high performance while preserving interpretability. However, efficient search strategies are needed to optimize such models. In this paper, we address this challenge by proposing a novel Quality-Diversity evolutionary optimization approach, based on MAP-Elites. We test the method on a team-based game, on which we introduce a coach agent, also optimized via evolutionary search, to optimize the team creation during training. The proposed strategy is tested in conjunction with three different evolutionary selection methods and two different mappings...

Thanks to the advances in deep Reinforcement Learning (RL) and its demonstrated capabilities to perform complex tasks, the field of Multi-Agent RL (MARL) has recently undergone major developments. However, current MARL approaches based on deep learning still suffer from a general lack of interpretability. Recently, hybrid models combining Decision Trees (DTs) with simple leaves running Q-Learning have been proposed as an alternative to achieve high performance while preserving interpretability. However, efficient search strategies are needed to optimize such models. In this paper, we address this challenge by proposing a novel Quality-Diversity evolutionary optimization approach, based on MAP-Elites. We test the method on a team-based game, on which we introduce a coach agent, also optimized via evolutionary search, to optimize the team creation during training. The proposed strategy is tested in conjunction with three different evolutionary selection methods and two different mappings between MAP-Elites archives and team members. Results demonstrate how the proposed approach can effectively find high-performing policies to accomplish the given task, while the coach pushes even further the team optimization, hence improving the algorithm’s overall performance.

A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning / Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni. - 15612:(2025), pp. 402-418. ( 28th European Conference on Applications of Evolutionary Computation, EvoApplications 2025, held as part of EvoStar 2025 Trieste 23rd April-25th April 2025) [10.1007/978-3-031-90062-4_25].

A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning

Erik Nielsen;Andrea Ferigo;Giovanni Iacca

2025-01-01

Abstract

Thanks to the advances in deep Reinforcement Learning (RL) and its demonstrated capabilities to perform complex tasks, the field of Multi-Agent RL (MARL) has recently undergone major developments. However, current MARL approaches based on deep learning still suffer from a general lack of interpretability. Recently, hybrid models combining Decision Trees (DTs) with simple leaves running Q-Learning have been proposed as an alternative to achieve high performance while preserving interpretability. However, efficient search strategies are needed to optimize such models. In this paper, we address this challenge by proposing a novel Quality-Diversity evolutionary optimization approach, based on MAP-Elites. We test the method on a team-based game, on which we introduce a coach agent, also optimized via evolutionary search, to optimize the team creation during training. The proposed strategy is tested in conjunction with three different evolutionary selection methods and two different mappings...

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2025
			
	Titolo del volume (Proceedings title)
	
				Applications of Evolutionary Computation. EvoApplications 2025
			
	Luogo di edizione (Place of publication)
	
				Cham
			
	Casa editrice (Publisher)
	
				Springer Science and Business Media Deutschland GmbH
			
	ISBN
	
				9783031900617
9783031900624
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-105003903026
			
	Tutti gli autori
	
						Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni
					
	Citazione
	
				A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning / Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni. - 15612:(2025), pp. 402-418. ( 28th European Conference on Applications of Evolutionary Computation, EvoApplications 2025, held as part of EvoStar 2025 Trieste 23rd April-25th April 2025) [10.1007/978-3-031-90062-4_25].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
paper_3.pdf embargo fino al 17/04/2026 Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 578.42 kB Formato Adobe PDF Visualizza/Apri	578.42 kB	Adobe PDF	Visualizza/Apri
A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.4 MB Formato Adobe PDF Visualizza/Apri	1.4 MB	Adobe PDF	Visualizza/Apri