To build more accurate and trustworthy artificial intelligence algorithms in deep learning, it is essential to understand the mechanisms driving classification systems to identify their targets. Typically, post hoc methods are used to provide insights into this process. By contrast, in this work, we investigate the possibility of using class activation maps in combination with contrastive loss to enhance the reliability of train- ing of a deep learning model. MNIST and Fashion MNIST datasets are considered in our investigation since they have already proven a prac- tical starting point for assessing an almost tautologic training strategy for deep learning algorithms, given that classification targets are the pri- mary significant content of the images in these datasets. Starting from the raw comparison of accuracy and system complexity of the proposed approach, a further investigation of the technique’s feasibility in a deep learning study is conducted over six random seed splits of the training data and model performance. A modern deep learning network, such as ConvNeXT, determines whether a more robust architecture trained with the proposed mechanics provides better insights than a simple convolu- tional neural network. This investigation also addresses the importance of skip connections, structured learning layers, and feature map dimen- sions in the learning process.

You’ve got the wrong outfit: evaluating deep learning paradigms on digit and fashion recognition

Ignesti G.
;
Martinelli M.;Moroni D.
2025

Abstract

To build more accurate and trustworthy artificial intelligence algorithms in deep learning, it is essential to understand the mechanisms driving classification systems to identify their targets. Typically, post hoc methods are used to provide insights into this process. By contrast, in this work, we investigate the possibility of using class activation maps in combination with contrastive loss to enhance the reliability of train- ing of a deep learning model. MNIST and Fashion MNIST datasets are considered in our investigation since they have already proven a prac- tical starting point for assessing an almost tautologic training strategy for deep learning algorithms, given that classification targets are the pri- mary significant content of the images in these datasets. Starting from the raw comparison of accuracy and system complexity of the proposed approach, a further investigation of the technique’s feasibility in a deep learning study is conducted over six random seed splits of the training data and model performance. A modern deep learning network, such as ConvNeXT, determines whether a more robust architecture trained with the proposed mechanics provides better insights than a simple convolu- tional neural network. This investigation also addresses the importance of skip connections, structured learning layers, and feature map dimen- sions in the learning process.
2025
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-3-031-87662-2
Artificial Intelligence
Trustworthy Artificial Intelligence (TAI)
eXplainable Artificial Intelligence (XAI)
Loss functions
Image Reconstruction
File in questo prodotto:
File Dimensione Formato  
Ignesti-Martinelli-Moroni_ICPR 2024.pdf

solo utenti autorizzati

Descrizione: You’ve Got the Wrong Outfit: Evaluating Deep Learning Paradigms on Digit and Fashion Recognition
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 2.04 MB
Formato Adobe PDF
2.04 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/544528
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact