Convolutional Neural Networks (CNNs) have become indispensable tools in skin cancer classification, aiding clinical experts to achieve earlier and more accurate diagnoses, improving treatment outcomes, and driving advancements in medical research. Despite their pivotal role, the most popular CNN architecture families exhibit a critical issue related to the distribution and quantity of available data, potentially compromising their performance and generalization abilities. This challenge is commonly overlooked in most skin lesion classification papers, which mainly rely on weighted classification techniques. Directly using appropriately dataset balancing or Transfer Learning (TL) methods, as suggested in recent studies, has the potential to deliver more satisfactory results, providing a more effective approach to addressing this issue. In the effort to tackle this problem, we provide a comprehensive quantitative evaluation aimed at identifying the most critical new emerging computational aspects and the related effective techniques. Specifically, we propose twelve Computational Models (CMs) based on four prominent CNN models with increasing structural complexity. We assess their effectiveness in both pretrained and unpretrained versions, incorporating TL strategies as well. Our experiments focus on the ISIC 2018 image dataset, a benchmark widely recognized for its extensive application in skin cancer research yet challenged by significant class imbalance issues. To mitigate this, we also randomly extracted a balanced image subset from ISIC 2018 for evaluation purposes. The experimental results, regarding four different scenarios, provide valuable insights into the design and utilization of CNNs for skin lesion classification, laying the groundwork for further investigations.

CNN Issues in Skin Lesion Classification: Data Distribution and Quantity

Ramella G.
;
Serino L.
2025

Abstract

Convolutional Neural Networks (CNNs) have become indispensable tools in skin cancer classification, aiding clinical experts to achieve earlier and more accurate diagnoses, improving treatment outcomes, and driving advancements in medical research. Despite their pivotal role, the most popular CNN architecture families exhibit a critical issue related to the distribution and quantity of available data, potentially compromising their performance and generalization abilities. This challenge is commonly overlooked in most skin lesion classification papers, which mainly rely on weighted classification techniques. Directly using appropriately dataset balancing or Transfer Learning (TL) methods, as suggested in recent studies, has the potential to deliver more satisfactory results, providing a more effective approach to addressing this issue. In the effort to tackle this problem, we provide a comprehensive quantitative evaluation aimed at identifying the most critical new emerging computational aspects and the related effective techniques. Specifically, we propose twelve Computational Models (CMs) based on four prominent CNN models with increasing structural complexity. We assess their effectiveness in both pretrained and unpretrained versions, incorporating TL strategies as well. Our experiments focus on the ISIC 2018 image dataset, a benchmark widely recognized for its extensive application in skin cancer research yet challenged by significant class imbalance issues. To mitigate this, we also randomly extracted a balanced image subset from ISIC 2018 for evaluation purposes. The experimental results, regarding four different scenarios, provide valuable insights into the design and utilization of CNNs for skin lesion classification, laying the groundwork for further investigations.
2025
Istituto per le applicazioni del calcolo - IAC - Sede Secondaria Napoli
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR - Sede Secondaria Napoli
Convolutional neural networks
Balanced image dataset
Dermoscopic image
Skin lesion classification
Transfer learning
Unbalanced image dataset
File in questo prodotto:
File Dimensione Formato  
Reprint_Access_2025.pdf

accesso aperto

Descrizione: Article
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 2 MB
Formato Adobe PDF
2 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/544884
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact