Large Language Models (LLMs) often have implicit knowledge of domain-specific rules, such as age requirements for obtaining a driver’s license, but may not consistently apply this knowledge in conversations. In this paper, we explore a method for fine-tuning LLMs using datasets generated by the LLM itself. The goal is to explicitly enforce specific rules, such as declaring ineligibility if the age requirement is not met, within a defined context. We evaluate whether this fine-tuning approach enables the model to recognize the need to apply relevant knowledge in other contexts, such as marriage eligibility, where the LLM already has knowledge of the underlying criteria. Our results show that after fine-tuning, the LLM not only applies the rule in the training contexts, but also generalizes this behavior to enforce the rule in different domains. This suggests that fine-tuning, even with self-generated datasets, can improve the ability of the LLM to apply its knowledge more consistently, leading to more reliable performance in rule-based scenarios.

Rule enforcement in LLMs: a parameter efficient fine-tuning approach with self-generated training dataset / Franch, D., Roberti, P., Blanzieri, E.. - 3903:(2024), pp. 17-32. (3rd Workshop on Artificial Intelligence for Human-Machine Interaction, AIxHMI 2024 ita 2024).

Rule enforcement in LLMs: a parameter efficient fine-tuning approach with self-generated training dataset

Roberti P.;Blanzieri E.
2024-01-01

Abstract

Large Language Models (LLMs) often have implicit knowledge of domain-specific rules, such as age requirements for obtaining a driver’s license, but may not consistently apply this knowledge in conversations. In this paper, we explore a method for fine-tuning LLMs using datasets generated by the LLM itself. The goal is to explicitly enforce specific rules, such as declaring ineligibility if the age requirement is not met, within a defined context. We evaluate whether this fine-tuning approach enables the model to recognize the need to apply relevant knowledge in other contexts, such as marriage eligibility, where the LLM already has knowledge of the underlying criteria. Our results show that after fine-tuning, the LLM not only applies the rule in the training contexts, but also generalizes this behavior to enforce the rule in different domains. This suggests that fine-tuning, even with self-generated datasets, can improve the ability of the LLM to apply its knowledge more consistently, leading to more reliable performance in rule-based scenarios.
2024
CEUR Workshop Proceedings
Germany
CEUR-WS
Franch, D.; Roberti, P.; Blanzieri, E.
Rule enforcement in LLMs: a parameter efficient fine-tuning approach with self-generated training dataset / Franch, D., Roberti, P., Blanzieri, E.. - 3903:(2024), pp. 17-32. (3rd Workshop on Artificial Intelligence for Human-Machine Interaction, AIxHMI 2024 ita 2024).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/490412
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact