Breast cancer is a major disease for women, and mammographic screening proved impressive benefits in reducing mortality risk. However, false positives and false negatives still occur due to human perception, differences in breast density, and the complexity of cancer itself. Convolutional Neural Networks (CNNs) have shown promise in medical imaging issues, but they struggle with understanding long-range spatial interactions in various image patches. Vision Transformers (ViTs) have emerged as a solution to this problem. This work used a subset of the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) to implement a ViT-based framework for breast cancer classification. Geometric and Diffuser-based data augmentation (DA) methods were applied and compared to evaluate the resulting performance improvement. The obtained results show how diffuser-based DA improves the performance of geometric DA. However, their combination allows for higher performance (accuracy = 77.01%, sensitivity = 88.89%, specificity = 68.63%) and demonstrates the feasibility and effectiveness of this approach in enhancing the model’s capabilities for breast cancer classification.

ViT-Based Classification of Mammogram Images: Impact of Data Augmentation Techniques

Militello, Carmelo;Vitabile, Salvatore
2025

Abstract

Breast cancer is a major disease for women, and mammographic screening proved impressive benefits in reducing mortality risk. However, false positives and false negatives still occur due to human perception, differences in breast density, and the complexity of cancer itself. Convolutional Neural Networks (CNNs) have shown promise in medical imaging issues, but they struggle with understanding long-range spatial interactions in various image patches. Vision Transformers (ViTs) have emerged as a solution to this problem. This work used a subset of the Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM) to implement a ViT-based framework for breast cancer classification. Geometric and Diffuser-based data augmentation (DA) methods were applied and compared to evaluate the resulting performance improvement. The obtained results show how diffuser-based DA improves the performance of geometric DA. However, their combination allows for higher performance (accuracy = 77.01%, sensitivity = 88.89%, specificity = 68.63%) and demonstrates the feasibility and effectiveness of this approach in enhancing the model’s capabilities for breast cancer classification.
2025
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR - Sede Secondaria Palermo
9789819609932
9789819609949
Breast cancer
Cbis-ddsm dataset
Data augmentation impact
Diffuser
Mammographic image classification
Visual transformer
File in questo prodotto:
File Dimensione Formato  
978-981-96-0994-9_21.pdf

solo utenti autorizzati

Descrizione: manoscritto
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 471.69 kB
Formato Adobe PDF
471.69 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/547566
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact