Written Multi-Party Conversations (WMPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., one-to-one ``reply-to'' links). This work explores the feasibility of generating synthetic WMPCs with instruction-tuned Large Language Models (LLMs) by providing deterministic constraints such as dialogue structure and participants’ stance. We investigate two complementary strategies of leveraging LLMs in this context: (i.) LLMs as WMPC generators, where we task the LLM to generate a whole WMPC at once and (ii.) LLMs as WMPC parties, where the LLM generates one turn of the conversation at a time (made of speaker, addressee and message), provided the conversation history. We next introduce an analytical framework to evaluate compliance with the constraints, content quality, and interaction complexity for both strategies. Finally, we assess the level of obtained WMPCs via human and LLM-as-a-judge evaluations. We find stark differences among LLMs, with only some being able to generate high-quality WMPCs. We also find that turn-by-turn generation yields better conformance to constraints and higher linguistic variability than generating WMPCs in one pass. Nonetheless, our structural and qualitative evaluation indicates that both generation strategies can yield high-quality WMPCs.

Don’t Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints / Penzo, Nicolò; Guerini, Marco; Lepri, Bruno; Glavaš, Goran; Tonelli, Sara. - ELETTRONICO. - 40:39(2026), pp. 32701-32709. ( The Fortieth AAAI Conference on Artificial Intelligence (AAAI-26) Singapore 20th May–27th May, 2026) [10.1609/aaai.v40i39.40548].

Don’t Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints

Penzo, Nicolò;Guerini, Marco;Lepri, Bruno;Tonelli, Sara
2026-01-01

Abstract

Written Multi-Party Conversations (WMPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., one-to-one ``reply-to'' links). This work explores the feasibility of generating synthetic WMPCs with instruction-tuned Large Language Models (LLMs) by providing deterministic constraints such as dialogue structure and participants’ stance. We investigate two complementary strategies of leveraging LLMs in this context: (i.) LLMs as WMPC generators, where we task the LLM to generate a whole WMPC at once and (ii.) LLMs as WMPC parties, where the LLM generates one turn of the conversation at a time (made of speaker, addressee and message), provided the conversation history. We next introduce an analytical framework to evaluate compliance with the constraints, content quality, and interaction complexity for both strategies. Finally, we assess the level of obtained WMPCs via human and LLM-as-a-judge evaluations. We find stark differences among LLMs, with only some being able to generate high-quality WMPCs. We also find that turn-by-turn generation yields better conformance to constraints and higher linguistic variability than generating WMPCs in one pass. Nonetheless, our structural and qualitative evaluation indicates that both generation strategies can yield high-quality WMPCs.
2026
Proceedings of the AAAI Conference on Artificial Intelligence
Washington, DC
AAAI Press
978-1-57735-906-7
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
Settore INF/01 - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
Settore INFO-01/A - Informatica
Penzo, Nicolò; Guerini, Marco; Lepri, Bruno; Glavaš, Goran; Tonelli, Sara
Don’t Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints / Penzo, Nicolò; Guerini, Marco; Lepri, Bruno; Glavaš, Goran; Tonelli, Sara. - ELETTRONICO. - 40:39(2026), pp. 32701-32709. ( The Fortieth AAAI Conference on Artificial Intelligence (AAAI-26) Singapore 20th May–27th May, 2026) [10.1609/aaai.v40i39.40548].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/483770
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact