Letting AI agents interact in multi-agent applications adds a layer of complexity to the interpretability and prediction of AI outcomes, with profound implications for their trustworthy adoption in research and society. Game theory offers powerful models to capture and interpret strategic interaction among agents, but requires the support of reproducible, standardized and user-friendly IT frameworks to enable comparison and interpretation of results. To this end, we present FAIRGAME, a Framework for AI Agents Bias Recognition using Game Theory. We describe its implementation and usage, and we employ it to uncover biased outcomes in popular games among AI agents, depending on the employed Large Language Model (LLM) and used language, as well as on the personality trait or strategic knowledge of the agents. Overall, FAIRGAME allows users to reliably and easily simulate their desired games and scenarios and compare the results across simulation campaigns and with game-theoretic predictions, enabling the systematic discovery of biases, the anticipation of emerging behavior out of strategic interplays, and empowering further research into strategic decision-making using LLM agents.

FAIRGAME: A Framework for AI Agents Bias Recognition Using Game Theory / Buscemi, Alessio; Proverbio, Daniele; Di Stefano, Alessandro; Han, The Anh; Castignani, German; Liò, Pietro. - 2025, 413:(2025), pp. 4097-4104. ( ECAI 2025 Bologna 25-30 October 2025) [10.3233/FAIA251300].

FAIRGAME: A Framework for AI Agents Bias Recognition Using Game Theory

Proverbio, Daniele
Co-primo
;
2025-01-01

Abstract

Letting AI agents interact in multi-agent applications adds a layer of complexity to the interpretability and prediction of AI outcomes, with profound implications for their trustworthy adoption in research and society. Game theory offers powerful models to capture and interpret strategic interaction among agents, but requires the support of reproducible, standardized and user-friendly IT frameworks to enable comparison and interpretation of results. To this end, we present FAIRGAME, a Framework for AI Agents Bias Recognition using Game Theory. We describe its implementation and usage, and we employ it to uncover biased outcomes in popular games among AI agents, depending on the employed Large Language Model (LLM) and used language, as well as on the personality trait or strategic knowledge of the agents. Overall, FAIRGAME allows users to reliably and easily simulate their desired games and scenarios and compare the results across simulation campaigns and with game-theoretic predictions, enabling the systematic discovery of biases, the anticipation of emerging behavior out of strategic interplays, and empowering further research into strategic decision-making using LLM agents.
2025
Frontiers in Artificial Intelligence and Applications
Amsterdam
IOS Press Ebooks
Buscemi, Alessio; Proverbio, Daniele; Di Stefano, Alessandro; Han, The Anh; Castignani, German; Liò, Pietro
FAIRGAME: A Framework for AI Agents Bias Recognition Using Game Theory / Buscemi, Alessio; Proverbio, Daniele; Di Stefano, Alessandro; Han, The Anh; Castignani, German; Liò, Pietro. - 2025, 413:(2025), pp. 4097-4104. ( ECAI 2025 Bologna 25-30 October 2025) [10.3233/FAIA251300].
File in questo prodotto:
File Dimensione Formato  
2025_ECAI_Fairgame.pdf

accesso aperto

Descrizione: ECAI 2025
Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 734.39 kB
Formato Adobe PDF
734.39 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/467861
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact