Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling

Bao, Wanqian; Hasson, Uri

doi:10.48550/arXiv.2403.06204

The question of whether people’s experience in the world shapes conceptual representation and lexical semantics is longstanding. Word-association, featurelisting and similarity rating tasks are methods that aim to address this question but ultimately require a subjective interpretation of the latent dimensions or clusters identified. In this study, we introduce a supervised representational-alignment method that (i) determines whether two groups of individuals share the same basis of a certain category, and (ii) explains in what respects they differ. In applying this method, we show that congenital blindness induces conceptual reorganization in both a-modal and sensory-related verbal domains, and we identify the associated semantic shifts. We first apply supervised feature-pruning to a language model (GloVe) to optimize prediction accuracy of human similarity judgments from word embeddings. Pruning identifies one subset of retained GloVe features that optimizes prediction of judgments made by sighted individuals and another subset that optimizes judgments made by blind. A linear probing analysis then interprets the latent semantics of these feature-subsets by learning a mapping from the retained GloVe features to 65 interpretable semantic dimensions. We applied this approach to seven semantic domains, including verbs related to motion, sight, touch, and amodal verbs related to knowledge acquisition. We find that blind individuals more strongly associate social and cognitive meanings to verbs related to motion or those communicating non-speech vocal utterances (e.g., whimper, moan). Conversely, for amodal verbs, they demonstrate much sparser information. Finally, for some verbs, representations of blind and sighted are highly similar. The study presents a formal approach for studying interindividual differences in word meaning, and the first demonstration of how blindness impacts conceptual representation of everyday verbs.

Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling / Bao, W., Hasson, U.. - (2024). (ICLR 2024 Workshop Re-Align Vienna 11th May 2024) [10.48550/arXiv.2403.06204].

Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling

Wanqian Bao;Uri Hasson^Ultimo

2024-01-01

Abstract

The question of whether people’s experience in the world shapes conceptual representation and lexical semantics is longstanding. Word-association, featurelisting and similarity rating tasks are methods that aim to address this question but ultimately require a subjective interpretation of the latent dimensions or clusters identified. In this study, we introduce a supervised representational-alignment method that (i) determines whether two groups of individuals share the same basis of a certain category, and (ii) explains in what respects they differ. In applying this method, we show that congenital blindness induces conceptual reorganization in both a-modal and sensory-related verbal domains, and we identify the associated semantic shifts. We first apply supervised feature-pruning to a language model (GloVe) to optimize prediction accuracy of human similarity judgments from word embeddings. Pruning identifies one subset of retained GloVe features that optimizes prediction of judgments made by sighted individuals and another subset that optimizes judgments made by blind. A linear probing analysis then interprets the latent semantics of these feature-subsets by learning a mapping from the retained GloVe features to 65 interpretable semantic dimensions. We applied this approach to seven semantic domains, including verbs related to motion, sight, touch, and amodal verbs related to knowledge acquisition. We find that blind individuals more strongly associate social and cognitive meanings to verbs related to motion or those communicating non-speech vocal utterances (e.g., whimper, moan). Conversely, for amodal verbs, they demonstrate much sparser information. Finally, for some verbs, representations of blind and sighted are highly similar. The study presents a formal approach for studying interindividual differences in word meaning, and the first demonstration of how blindness impacts conceptual representation of everyday verbs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2024
			
	Titolo del volume (Proceedings title)
	
				ICLR 2024 Workshop on Representational Alignment
			
	Luogo di edizione (Place of publication)
	
				Online
			
	Casa editrice (Publisher)
	
				Online
			
	Tutti gli autori
	
						Bao, Wanqian; Hasson, Uri
					
	Citazione
	
				Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling / Bao, W., Hasson, U.. - (2024). (ICLR 2024 Workshop Re-Align Vienna 11th May 2024) [10.48550/arXiv.2403.06204].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2403.06204.pdf accesso aperto Descrizione: arXiv camera ready version Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Creative commons Dimensione 1.29 MB Formato Adobe PDF Visualizza/Apri	1.29 MB	Adobe PDF	Visualizza/Apri