Thanks to the advances in deep Reinforcement Learning (RL) and its demonstrated capabilities to perform complex tasks, the field of Multi-Agent RL (MARL) has recently undergone major developments. However, current MARL approaches based on deep learning still suffer from a general lack of interpretability. Recently, hybrid models combining Decision Trees (DTs) with simple leaves running Q-Learning have been proposed as an alternative to achieve high performance while preserving interpretability. However, efficient search strategies are needed to optimize such models. In this paper, we address this challenge by proposing a novel Quality-Diversity evolutionary optimization approach, based on MAP-Elites. We test the method on a team-based game, on which we introduce a coach agent, also optimized via evolutionary search, to optimize the team creation during training. The proposed strategy is tested in conjunction with three different evolutionary selection methods and two different mappings...

Thanks to the advances in deep Reinforcement Learning (RL) and its demonstrated capabilities to perform complex tasks, the field of Multi-Agent RL (MARL) has recently undergone major developments. However, current MARL approaches based on deep learning still suffer from a general lack of interpretability. Recently, hybrid models combining Decision Trees (DTs) with simple leaves running Q-Learning have been proposed as an alternative to achieve high performance while preserving interpretability. However, efficient search strategies are needed to optimize such models. In this paper, we address this challenge by proposing a novel Quality-Diversity evolutionary optimization approach, based on MAP-Elites. We test the method on a team-based game, on which we introduce a coach agent, also optimized via evolutionary search, to optimize the team creation during training. The proposed strategy is tested in conjunction with three different evolutionary selection methods and two different mappings between MAP-Elites archives and team members. Results demonstrate how the proposed approach can effectively find high-performing policies to accomplish the given task, while the coach pushes even further the team optimization, hence improving the algorithm’s overall performance.

A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning / Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni. - 15612:(2025), pp. 402-418. ( 28th European Conference on Applications of Evolutionary Computation, EvoApplications 2025, held as part of EvoStar 2025 Trieste 23rd April-25th April 2025) [10.1007/978-3-031-90062-4_25].

A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning

Erik Nielsen;Andrea Ferigo;Giovanni Iacca
2025-01-01

Abstract

Thanks to the advances in deep Reinforcement Learning (RL) and its demonstrated capabilities to perform complex tasks, the field of Multi-Agent RL (MARL) has recently undergone major developments. However, current MARL approaches based on deep learning still suffer from a general lack of interpretability. Recently, hybrid models combining Decision Trees (DTs) with simple leaves running Q-Learning have been proposed as an alternative to achieve high performance while preserving interpretability. However, efficient search strategies are needed to optimize such models. In this paper, we address this challenge by proposing a novel Quality-Diversity evolutionary optimization approach, based on MAP-Elites. We test the method on a team-based game, on which we introduce a coach agent, also optimized via evolutionary search, to optimize the team creation during training. The proposed strategy is tested in conjunction with three different evolutionary selection methods and two different mappings...
2025
Applications of Evolutionary Computation. EvoApplications 2025
Cham
Springer Science and Business Media Deutschland GmbH
9783031900617
9783031900624
Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni
A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning / Nielsen, Erik; Ferigo, Andrea; Iacca, Giovanni. - 15612:(2025), pp. 402-418. ( 28th European Conference on Applications of Evolutionary Computation, EvoApplications 2025, held as part of EvoStar 2025 Trieste 23rd April-25th April 2025) [10.1007/978-3-031-90062-4_25].
File in questo prodotto:
File Dimensione Formato  
paper_3.pdf

embargo fino al 17/04/2026

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 578.42 kB
Formato Adobe PDF
578.42 kB Adobe PDF   Visualizza/Apri
A Coach-Based Quality-Diversity Approach for Multi-agent Interpretable Reinforcement Learning.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.4 MB
Formato Adobe PDF
1.4 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/452212
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 0
social impact