Exposing racial dialect bias in abusive language detection: can explainability play a role?

Manerba, Mm; Morini, V

doi:10.1007/978-3-031-23618-1_32

Biases can arise and be introduced during each phase of a supervised learning pipeline, eventually leading to harm. Within the task of automatic abusive language detection, this matter becomes particularly severe since unintended bias towards sensitive topics such as gender, sexual orientation, or ethnicity can harm underrepresented groups. The role of the datasets used to train these models is crucial to address these challenges. In this contribution, we investigate whether explainability methods can expose racial dialect bias attested within a popular dataset for abusive language detection. Through preliminary experiments, we found that pure explainability techniques cannot effectively uncover biases within the dataset under analysis: the rooted stereotypes are often more implicit and complex to retrieve.

Exposing racial dialect bias in abusive language detection: can explainability play a role?

Manerba MM;Morini V

2023

Abstract

Biases can arise and be introduced during each phase of a supervised learning pipeline, eventually leading to harm. Within the task of automatic abusive language detection, this matter becomes particularly severe since unintended bias towards sensitive topics such as gender, sexual orientation, or ethnicity can harm underrepresented groups. The role of the datasets used to train these models is crucial to address these challenges. In this contribution, we investigate whether explainability methods can expose racial dialect bias attested within a popular dataset for abusive language detection. Through preliminary experiments, we found that pure explainability techniques cannot effectively uncover biases within the dataset under analysis: the rooted stereotypes are often more implicit and complex to retrieve.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Lingua/e
	
				Inglese
			
	Supervisori e coordinatori esterni
	
				Koprinska I. et al.
			
	Titolo del Volume
	
				Machine Learning and Principles and Practice of Knowledge Discovery in Databases
			
	Serie
	
				COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE (PRINT)
			
	Titolo del convegno
	
				ECML PKDD 2022 - Joint European Conference on Machine Learning and Knowledge Discovery in Databases
			
	Da pagina
	
				483
			
	A pagina
	
				497
			
	Codice ISBN
	
				978-3-031-23617-4
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-031-23618-1_32
			
	URL
	
				https://link.springer.com/chapter/10.1007/978-3-031-23618-1_32
			
	Periodo del Convegno
	
				19-23/09/2022
			
	Luogo del Convegno
	
				Grenoble, France
			
	Parole chiave
	
				ML
NLP
Explainability
Interpretability
ML Evaluation
Fairness in ML
Algorithmic bias
Bias discovery
Algorithmic auditing
Data awareness
Discrimination
			
	Codice Scopus
	
				2-s2.0-85149840160
			
	Codice Web of Science
	
				WOS:000967751800032
			
	Numero autori
	
				1
			
	Fulltext
	
				open
			
	Tutti gli autori
	
						Manerba M.M.; Morini V.
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Identificativo progetto
	
	Titolo Progetto
	
									SoBigData++: European Integrated Infrastructure for Social Mining and Big Data Analytics
								
	Acronimo
	
									SoBigData-PlusPlus
								
	Finanziamento
	
									H2020
								
	N. Contratto
	
									871042
								
	Titolo Progetto
	
									HumanE AI Network
								
	Acronimo
	
									HumanE-AI-Net
								
	Finanziamento
	
									H2020
								
	N. Contratto
	
									952026
								
	Titolo Progetto
	
									Foundations of Trustworthy AI - Integrating Reasoning, Learning and Optimization
								
	Acronimo
	
									TAILOR
								
	Finanziamento
	
									H2020
								
	N. Contratto
	
									952215
								
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_479349-doc_196638.pdf accesso aperto Descrizione: Preprint - Exposing racial dialect bias in abusive language detection: can explainability play a role? Tipologia: Versione Editoriale (PDF) Dimensione 840.75 kB Formato Adobe PDF Visualizza/Apri	840.75 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/439220

Citazioni

ND

2

0

CNR Institutional Research Information System

Exposing racial dialect bias in abusive language detection: can explainability play a role?

Manerba MM;Morini V

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

CNR Institutional Research Information System

Exposing racial dialect bias in abusive language detection: can explainability play a role?

Manerba MM;Morini V

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)