Iterative Adversarial Removal of Gender Bias in Pretrained Word Embeddings

Gaci, Y; Benatallah, B; Casati, F; Benabdeslem, K

doi:10.1145/3477314.3507274

Recent advances in Representation Learning have discovered a strong inclination for pre-trained word embeddings to demonstrate unfair and discriminatory gender stereotypes. These usually come in the shape of unjustified associations between representations of group words (e.g., male or female) and attribute words (e.g. driving, cooking, doctor, nurse, etc.) In this paper, we propose an iterative and adversarial procedure to reduce gender bias in word vectors. We aim to remove gender influence from word representations that should otherwise be free of it, while retaining meaningful gender information in words that are inherently charged with gender polarity (male or female). We confine these gender signals in a sub-vector of word embeddings to make them more interpretable. Quantitative and qualitative experiments confirm that our method successfully reduces gender bias in pre-trained word embeddings with minimal semantic offset.

Iterative Adversarial Removal of Gender Bias in Pretrained Word Embeddings / Gaci, Y; Benatallah, B; Casati, F; Benabdeslem, K. - (2022), pp. 829-836. (Intervento presentato al convegno 37th ACM/SIGAPP Symposium on Applied Computing, SAC 2022 tenutosi a Online nel 25 - 29 April 2022) [10.1145/3477314.3507274].

Iterative Adversarial Removal of Gender Bias in Pretrained Word Embeddings

Gaci, Y;Benatallah, B;Casati, F;Benabdeslem, K

2022-01-01

Abstract

Recent advances in Representation Learning have discovered a strong inclination for pre-trained word embeddings to demonstrate unfair and discriminatory gender stereotypes. These usually come in the shape of unjustified associations between representations of group words (e.g., male or female) and attribute words (e.g. driving, cooking, doctor, nurse, etc.) In this paper, we propose an iterative and adversarial procedure to reduce gender bias in word vectors. We aim to remove gender influence from word representations that should otherwise be free of it, while retaining meaningful gender information in words that are inherently charged with gender polarity (male or female). We confine these gender signals in a sub-vector of word embeddings to make them more interpretable. Quantitative and qualitative experiments confirm that our method successfully reduces gender bias in pre-trained word embeddings with minimal semantic offset.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del volume (Proceedings title)
	
				SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing
			
	Luogo di edizione (Place of publication)
	
				1601 Broadway, 10th Floor, NEW YORK, NY, UNITED STATES
			
	Casa editrice (Publisher)
	
				ASSOC COMPUTING MACHINERY
			
	ISBN
	
				9781450387132
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85130381995
			
	Codice WOS (WOS identifier)
	
				WOS:000946564100113
			
	Tutti gli autori
	
						Gaci, Y; Benatallah, B; Casati, F; Benabdeslem, K
					
	Citazione
	
				Iterative Adversarial Removal of Gender Bias in Pretrained Word Embeddings / Gaci, Y; Benatallah, B; Casati, F; Benabdeslem, K. - (2022), pp. 829-836. (Intervento presentato al  convegno 37th ACM/SIGAPP Symposium on Applied Computing, SAC 2022 tenutosi a Online nel 25 - 29 April 2022) [10.1145/3477314.3507274].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
SAC___Iterative_Adversarial_removal_of_gender_bias (1).pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 943.83 kB Formato Adobe PDF Visualizza/Apri	943.83 kB	Adobe PDF	Visualizza/Apri
3477314.3507274.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.32 MB Formato Adobe PDF Visualizza/Apri	1.32 MB	Adobe PDF	Visualizza/Apri