Underrepresentation of Dark Skin Tone in Skin Lesion Datasets: The Role of the Explainable Techniques in Assessing the Bias

Ruga, Tommaso; Zumpano, Ester; Vocaturo, Eugenio; Caroprese, Luciano

doi:10.1007/978-3-032-05727-3_37

Advanced artificial intelligence models for skin lesion classification often suffer from performance disparities when applied to images of patients with darker skin tones, largely due to underrepresentation of dark skin tone images in training datasets. In this study, we investigate this issue by evaluating a previously proposed explainable framework, MultiExCAM, trained on the widely used ISIC2018 dataset. We test its performance on Pipsqueak, a previously proposed dataset composed by skin lesion images on dark skin tones. As expected, we observe a significant drop in classification performance when the model is applied to Pipsqueak. To better understand the source of these failures, we employ explainable artificial intelligence techniques to visualize and analyze the model’s decision-making process on both datasets. Our results highlight clear differences in attention patterns and decision rationale, revealing how the lack of dark skin tone representation in the training data leads to poor generalization and biased behavior. This work emphasizes the critical role of explainable analysis in exposing and understanding model bias in clinical applications, and the necessity of inclusive datasets for fair and reliable skin lesion classification.

Underrepresentation of Dark Skin Tone in Skin Lesion Datasets: The Role of the Explainable Techniques in Assessing the Bias

Ruga, Tommaso;Zumpano, Ester;Vocaturo, Eugenio;Caroprese, Luciano

2025

Abstract

Advanced artificial intelligence models for skin lesion classification often suffer from performance disparities when applied to images of patients with darker skin tones, largely due to underrepresentation of dark skin tone images in training datasets. In this study, we investigate this issue by evaluating a previously proposed explainable framework, MultiExCAM, trained on the widely used ISIC2018 dataset. We test its performance on Pipsqueak, a previously proposed dataset composed by skin lesion images on dark skin tones. As expected, we observe a significant drop in classification performance when the model is applied to Pipsqueak. To better understand the source of these failures, we employ explainable artificial intelligence techniques to visualize and analyze the model’s decision-making process on both datasets. Our results highlight clear differences in attention patterns and decision rationale, revealing how the lack of dark skin tone representation in the training data leads to poor generalization and biased behavior. This work emphasizes the critical role of explainable analysis in exposing and understanding model bias in clinical applications, and the necessity of inclusive datasets for fair and reliable skin lesion classification.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Strutture organizzative
	
				Istituto di Nanotecnologia - NANOTEC - Sede Secondaria Rende (CS)
			
	Codice ISBN
	
				9783032057266
9783032057273
			
	Parole chiave
	
				Melanoma Classification; Dataset Bias; Explainable AI; Skin Tone Diversity;
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ADBIS2025_Melanoma.pdf solo utenti autorizzati Licenza: Dominio pubblico Dimensione 1.24 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.24 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/554380

Citazioni

ND

ND

ND

CNR Institutional Research Information System

Underrepresentation of Dark Skin Tone in Skin Lesion Datasets: The Role of the Explainable Techniques in Assessing the Bias

Ruga, Tommaso;Zumpano, Ester;Vocaturo, Eugenio;Caroprese, Luciano

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

CNR Institutional Research Information System

Underrepresentation of Dark Skin Tone in Skin Lesion Datasets: The Role of the Explainable Techniques in Assessing the Bias

Ruga, Tommaso;Zumpano, Ester;Vocaturo, Eugenio;Caroprese, Luciano

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)