Biases can arise and be introduced during each phase of a supervised learning pipeline, eventually leading to harm. Within the task of automatic abusive language detection, this matter becomes particularly severe since unintended bias towards sensitive topics such as gender, sexual orientation, or ethnicity can harm underrepresented groups. The role of the datasets used to train these models is crucial to address these challenges. In this contribution, we investigate whether explainability methods can expose racial dialect bias attested within a popular dataset for abusive language detection. Through preliminary experiments, we found that pure explainability techniques cannot effectively uncover biases within the dataset under analysis: the rooted stereotypes are often more implicit and complex to retrieve.

Exposing racial dialect bias in abusive language detection: can explainability play a role?

Morini V
2023

Abstract

Biases can arise and be introduced during each phase of a supervised learning pipeline, eventually leading to harm. Within the task of automatic abusive language detection, this matter becomes particularly severe since unintended bias towards sensitive topics such as gender, sexual orientation, or ethnicity can harm underrepresented groups. The role of the datasets used to train these models is crucial to address these challenges. In this contribution, we investigate whether explainability methods can expose racial dialect bias attested within a popular dataset for abusive language detection. Through preliminary experiments, we found that pure explainability techniques cannot effectively uncover biases within the dataset under analysis: the rooted stereotypes are often more implicit and complex to retrieve.
2023
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Koprinska I. et al.
Machine Learning and Principles and Practice of Knowledge Discovery in Databases
ECML PKDD 2022 - Joint European Conference on Machine Learning and Knowledge Discovery in Databases
483
497
978-3-031-23617-4
https://link.springer.com/chapter/10.1007/978-3-031-23618-1_32
19-23/09/2022
Grenoble, France
ML
NLP
Explainability
Interpretability
ML Evaluation
Fairness in ML
Algorithmic bias
Bias discovery
Algorithmic auditing
Data awareness
Discrimination
1
open
Manerba M.M.; Morini V.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics
   SoBigData-PlusPlus
   H2020
   871042

   HumanE AI Network
   HumanE-AI-Net
   H2020
   952026

   Foundations of Trustworthy AI - Integrating Reasoning, Learning and Optimization
   TAILOR
   H2020
   952215
File in questo prodotto:
File Dimensione Formato  
prod_479349-doc_196638.pdf

accesso aperto

Descrizione: Preprint - Exposing racial dialect bias in abusive language detection: can explainability play a role?
Tipologia: Versione Editoriale (PDF)
Dimensione 840.75 kB
Formato Adobe PDF
840.75 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/439220
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact