This sounds like that: explainable audio classification via prototypical parts

Fedele, A.; Guidotti, R.; Pedreschi, D.

doi:10.1007/978-3-031-78980-9_22

The demand for understanding machine learning models has led to the development of interpretable-by-design models that provide both outcomes and explanations. In this paper, we extend the concept of Prototypical Part Networks to the audio domain with SonicProtoPNet. This model enables a “this sounds like that” reasoning for audio classification, where a test instance audio is classified based on prototypical parts that most resemble specific areas of specific training instances. Quantitative results from genre and environmental sound classification, as well as musical instrument recognition tasks, demonstrate satisfactory per formance using the Log-Mel transformation of the audio input signal, further supported by backbone pre-training on image-input data. Furthermore, we introduce a high-quality back-soundification method for the learned sonic prototypes, facilitating intuitive interpretation of classification decisions through auditory inspection.

This sounds like that: explainable audio classification via prototypical parts

Fedele A.;Guidotti R.;Pedreschi D.

2025

Abstract

The demand for understanding machine learning models has led to the development of interpretable-by-design models that provide both outcomes and explanations. In this paper, we extend the concept of Prototypical Part Networks to the audio domain with SonicProtoPNet. This model enables a “this sounds like that” reasoning for audio classification, where a test instance audio is classified based on prototypical parts that most resemble specific areas of specific training instances. Quantitative results from genre and environmental sound classification, as well as musical instrument recognition tasks, demonstrate satisfactory per formance using the Log-Mel transformation of the audio input signal, further supported by backbone pre-training on image-input data. Furthermore, we introduce a high-quality back-soundification method for the learned sonic prototypes, facilitating intuitive interpretation of classification decisions through auditory inspection.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Codice ISBN
	
				9783031789793
9783031789809
			
	Parole chiave
	
				Explainable Artificial Intelligence
Explainable Audio Classification
Part Prototypical Interpretability
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Fedele-Guidotti-Pedreschi_Springer 2025.pdf solo utenti autorizzati Descrizione: This Sounds Like That: Explainable Audio Classification via Prototypical Parts Tipologia: Versione Editoriale (PDF) Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 1.13 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.13 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
Guidotti et al_This Sounds Like That_Postprint.pdf Open Access dal 29/01/2026 Descrizione: This Sounds Like That Explainable Audio Classification via Prototypical Parts Tipologia: Documento in Post-print Licenza: Altro tipo di licenza Dimensione 6.02 MB Formato Adobe PDF Visualizza/Apri	6.02 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/549102

Citazioni

ND

2

1

CNR Institutional Research Information System

This sounds like that: explainable audio classification via prototypical parts

Fedele A.;Guidotti R.;Pedreschi D.

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

CNR Institutional Research Information System

This sounds like that: explainable audio classification via prototypical parts

Fedele A.;Guidotti R.;Pedreschi D.

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)