Exploring Paraphrasing Strategies for CEFR A1-Level Constraints in LLMs

Marzona, Eugenio; Goikhman, Mariia; Aprosio, Alessio Palmero; Zancanaro, Massimo

doi:10.18653/v1/2025.findings-emnlp.828

Large language models are increasingly used for teaching and self-learning foreign languages. However, their capability to meet specific linguistic constraints is still underexplored. This study compares the effectiveness of prompt engineering in guiding ChatGPT (4o and 4o-mini), and Llama 3 to rephrase general-domain texts to meet CEFR A1-level constraints in English and Italian, making them suitable for beginner learners. It compares 4 prompt engineering approaches, built upon iterative paraphrasing method that gradually refines original texts for CEFR compliance. The approaches compared include paraphrasing with or without Chain-of-Thought, as well as grammar and vocabulary simplification performed either simultaneously or as separate steps. The findings suggest that for English the best approach is combining COT with separate grammar and vocabulary simplification, while for Italian one-step strategies have better effect on grammar, and two-step strategies work better for covering the vocabulary. The paraphrasing approach can approve compliance, although at this point it is not cost-effective. We release a dataset of pairs original sentence-beginner level paraphrase (both in Italian and in English) on which further work could be based.

Exploring Paraphrasing Strategies for CEFR A1-Level Constraints in LLMs / Marzona, E., Goikhman, M., Aprosio, A.P., Zancanaro, M.. - (2025), pp. 15305-15318. (EMNLP Suzhou, China 4-9 November 2025) [10.18653/v1/2025.findings-emnlp.828].

Exploring Paraphrasing Strategies for CEFR A1-Level Constraints in LLMs

Marzona, Eugenio;Goikhman, Mariia;Aprosio, Alessio Palmero;Zancanaro, Massimo

2025-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2025
			
	Titolo del volume (Proceedings title)
	
				Findings of the Association for Computational Linguistics: EMNLP 2025
			
	Luogo di edizione (Place of publication)
	
				Suzhou, China
			
	Casa editrice (Publisher)
	
				ACL
			
	Settori scientifico-disciplinari (validi fino a 24/06/2024) - Reference SSD (valid until 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Settori scientifico-disciplinari (validi dal 09/05/2024) - Reference SSD (valid from 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-105028940641
			
	Tutti gli autori
	
						Marzona, Eugenio; Goikhman, Mariia; Aprosio, Alessio Palmero; Zancanaro, Massimo
					
	Citazione
	
				Exploring Paraphrasing Strategies for CEFR A1-Level Constraints in LLMs / Marzona, E., Goikhman, M., Aprosio, A.P., Zancanaro, M.. - (2025), pp. 15305-15318. (EMNLP Suzhou, China 4-9 November 2025) [10.18653/v1/2025.findings-emnlp.828].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2025.findings-emnlp.828.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 216.28 kB Formato Adobe PDF Visualizza/Apri	216.28 kB	Adobe PDF	Visualizza/Apri