Language Models Use Monotonicity to Assess NPI Licensing

Jumelet, J.; Denic, M.; Szymanik, J.; Hupkes, D.; Steinert-Threlkeld, S.

doi:10.18653/v1/2021.findings-acl.439

We investigate the semantic knowledge of language models (LMs), focusing on (1) whether these LMs create categories of linguistic environments based on their semantic monotonicity properties, and (2) whether these categories play a similar role in LMs as in human language understanding, using negative polarity item licensing as a case study. We introduce a series of experiments consisting of probing with diagnostic classifiers (DCs), linguistic acceptability tasks, as well as a novel DC ranking method that tightly connects the probing results to the inner workings of the LM. By applying our experimental pipeline to LMs trained on various filtered corpora, we are able to gain stronger insights into the semantic generalizations that are acquired by these models.1

Language Models Use Monotonicity to Assess NPI Licensing / Jumelet, J.; Denic, M.; Szymanik, J.; Hupkes, D.; Steinert-Threlkeld, S.. - (2021), pp. 4958-4969. ( Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 Bangkok 2021) [10.18653/v1/2021.findings-acl.439].

Language Models Use Monotonicity to Assess NPI Licensing

Jumelet, J.;Denic, M.;Szymanik, J.;Hupkes, D.;Steinert-Threlkeld, S.

2021-01-01

Abstract

We investigate the semantic knowledge of language models (LMs), focusing on (1) whether these LMs create categories of linguistic environments based on their semantic monotonicity properties, and (2) whether these categories play a similar role in LMs as in human language understanding, using negative polarity item licensing as a case study. We introduce a series of experiments consisting of probing with diagnostic classifiers (DCs), linguistic acceptability tasks, as well as a novel DC ranking method that tightly connects the probing results to the inner workings of the LM. By applying our experimental pipeline to LMs trained on various filtered corpora, we are able to gain stronger insights into the semantic generalizations that are acquired by these models.1

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del volume (Proceedings title)
	
				Findings of the Association of Computational Linguistics
			
	Luogo di edizione (Place of publication)
	
				USA
			
	Casa editrice (Publisher)
	
				Association of Computational Linguistics
			
	ISBN
	
				9781954085541
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85117033075
			
	Codice WOS (WOS identifier)
	
				WOS:001181734704031
			
	Tutti gli autori
	
						Jumelet, J.; Denic, M.; Szymanik, J.; Hupkes, D.; Steinert-Threlkeld, S.
					
	Citazione
	
				Language Models Use Monotonicity to Assess NPI Licensing / Jumelet, J.; Denic, M.; Szymanik, J.; Hupkes, D.; Steinert-Threlkeld, S.. - (2021), pp. 4958-4969. ( Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 Bangkok 2021) [10.18653/v1/2021.findings-acl.439].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2105.13818.pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.24 MB Formato Adobe PDF Visualizza/Apri	1.24 MB	Adobe PDF	Visualizza/Apri
2021.findings-acl.439.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 1.23 MB Formato Adobe PDF Visualizza/Apri	1.23 MB	Adobe PDF	Visualizza/Apri