Skip to Main Content (Press Enter)

Logo UNIBS
  • ×
  • Home
  • Persone
  • Strutture
  • Competenze
  • Pubblicazioni
  • Professioni
  • Corsi
  • Insegnamenti
  • Terza Missione

Competenze & Professionalità
Logo UNIBS

|

Competenze & Professionalità

unibs.it
  • ×
  • Home
  • Persone
  • Strutture
  • Competenze
  • Pubblicazioni
  • Professioni
  • Corsi
  • Insegnamenti
  • Terza Missione
  1. Pubblicazioni

Graphical Identification of Gender Bias in BERT with a Weakly Supervised Approach

Contributo in Atti di convegno
Data di Pubblicazione:
2022
Abstract:
Transformer-based algorithms such as BERT are typically trained with large corpora of documents, extracted directly from the Internet. As reported by several studies, these data can contain biases, stereotypes and other properties which are transferred also to the machine learning models, potentially leading them to a discriminatory behaviour which should be identified and corrected. A very intuitive technique for bias identification in NLP models is the visualization of word embeddings. Exploiting the concept of that a short distance between two word vectors means a semantic similarity between these two words; for instance, a closeness between the terms nurse and woman could be an indicator of gender bias in the model. These techniques however were designed for static word embedding algorithms such as Word2Vec. Instead, BERT does not guarantee the same relation between semantic similarity and short distance, making the visualization techniques more difficult to apply. In this work, we propose a weakly supervised approach, which only requires a list of gendered words that can be easily found in online lexical resources, for visualizing the gender bias present in the English base model of BERT. Our approach is based on a Linear Support Vector Classifier and Principal Component Analysis (PCA) and obtains better results with respect to standard PCA.
Tipologia CRIS:
4.1 Contributo in Atti di convegno
Keywords:
BERT; Ethics; Fairness; Gender Bias; Model Interpretability
Elenco autori:
Dusi, M.; Arici, N.; Gerevini, A. E.; Putelli, L.; Serina, I.
Autori di Ateneo:
GEREVINI Alfonso Emilio
PUTELLI LUCA
SERINA Ivan
Link alla scheda completa:
https://iris.unibs.it/handle/11379/579368
Titolo del libro:
CEUR Workshop Proceedings
Pubblicato in:
CEUR WORKSHOP PROCEEDINGS
Journal
CEUR WORKSHOP PROCEEDINGS
Series
  • Assistenza
  • Privacy
  • Utilizzo dei cookie
  • Note legali

Realizzato con VIVO | Designed by Cineca | 26.5.2.0