CNR Institutional Research Information System

We present an approach to explain the decisions of black box image classifiers through synthetic exemplar and counterexemplar learnt in the latent feature space. Our explanation method exploits the latent representations learned through an adversarial autoencoder for generating a synthetic neighborhood of the image for which an explanation is required. A decision tree is trained on a set of images represented in the latent space, and its decision rules are used to generate exemplar images showing how the original image can be modified to stay within its class. Counterfactual rules are used to generate counter-exemplars showing how the original image can "morph"into another class. The explanation also comprehends a saliency map highlighting the areas that contribute to its classification, and areas that push it into another class. A wide and deep experimental evaluation proves that the proposed method outperforms existing explainers in terms of fidelity, relevance, coherence, and stability, besides providing the most useful and interpretable explanations.

Explaining image classifiers generating exemplars and counter-exemplars from latent representations

Guidotti R.;Monreale A.;Matwin S.;Pedreschi D.

2020

Abstract

We present an approach to explain the decisions of black box image classifiers through synthetic exemplar and counterexemplar learnt in the latent feature space. Our explanation method exploits the latent representations learned through an adversarial autoencoder for generating a synthetic neighborhood of the image for which an explanation is required. A decision tree is trained on a set of images represented in the latent space, and its decision rules are used to generate exemplar images showing how the original image can be modified to stay within its class. Counterfactual rules are used to generate counter-exemplars showing how the original image can "morph"into another class. The explanation also comprehends a saliency map highlighting the areas that contribute to its classification, and areas that push it into another class. A wide and deep experimental evaluation proves that the proposed method outperforms existing explainers in terms of fidelity, relevance, coherence, and stability, besides providing the most useful and interpretable explanations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Lingua/e
	
				Inglese
			
	Titolo del Volume
	
				Proceedings of the AAAI Conference on Artificial Intelligence
			
	Titolo del convegno
	
				AAAI 2020 - Thirty-Fourth AAAI Conference on Artificial Intelligence
			
	Volume
	
				34
			
	Fascicolo
	
				9
			
	Da pagina
	
				13665
			
	A pagina
	
				13668
			
	Numero di pagine
	
				4
			
	Codice ISBN
	
				9781577358350
			
	Codice DOI
	
				https://dx.doi.org/10.1609/aaai.v34i09.7116
			
	URL
	
				https://ojs.aaai.org/index.php/AAAI/article/view/7116
			
	Periodo del Convegno
	
				07-12/02/2020
			
	Luogo del Convegno
	
				New York, USA
			
	Parole chiave
	
				Explainable AI
			
	Codice Scopus
	
				2-s2.0-85102539057
			
	Codice Web of Science
	
				WOS:000668126806037
			
	Formato
	
				Elettronico
			
	Presenza di coautori internazionali
	
				Sì
			
	Numero autori
	
				4
			
	Fulltext
	
				open
			
	Tutti gli autori
	
						Guidotti, R.; Monreale, A.; Matwin, S.; Pedreschi, D.
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Identificativo progetto
	
	Titolo Progetto
	
									SoBigData Research Infrastructure
								
	Acronimo
	
									SoBigData
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									654024
								
	Titolo Progetto
	
									A European AI On Demand Platform and Ecosystem
								
	Acronimo
	
									AI4EU
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									825619
								
	Titolo Progetto
	
									Toward AI Systems That Augment and Empower Humans by Understanding Us, our Society and the World Around Us
								
	Acronimo
	
									Humane AI
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									820437
								
	Titolo Progetto
	
									PROmoting integrity in the use of RESearch results
								
	Acronimo
	
									PRO-RES
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									788352
								
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_460367-doc_179462.pdf accesso aperto Descrizione: abele_aaai Tipologia: Versione Editoriale (PDF) Licenza: Altro tipo di licenza Dimensione 412.94 kB Formato Adobe PDF Visualizza/Apri	412.94 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/440614

Citazioni

ND

18

13

social impact