We present an approach to explain the decisions of black box image classifiers through synthetic exemplar and counterexemplar learnt in the latent feature space. Our explanation method exploits the latent representations learned through an adversarial autoencoder for generating a synthetic neighborhood of the image for which an explanation is required. A decision tree is trained on a set of images represented in the latent space, and its decision rules are used to generate exemplar images showing how the original image can be modified to stay within its class. Counterfactual rules are used to generate counter-exemplars showing how the original image can "morph"into another class. The explanation also comprehends a saliency map highlighting the areas that contribute to its classification, and areas that push it into another class. A wide and deep experimental evaluation proves that the proposed method outperforms existing explainers in terms of fidelity, relevance, coherence, and stability, besides providing the most useful and interpretable explanations.

Explaining image classifiers generating exemplars and counter-exemplars from latent representations

Guidotti R.;
2020

Abstract

We present an approach to explain the decisions of black box image classifiers through synthetic exemplar and counterexemplar learnt in the latent feature space. Our explanation method exploits the latent representations learned through an adversarial autoencoder for generating a synthetic neighborhood of the image for which an explanation is required. A decision tree is trained on a set of images represented in the latent space, and its decision rules are used to generate exemplar images showing how the original image can be modified to stay within its class. Counterfactual rules are used to generate counter-exemplars showing how the original image can "morph"into another class. The explanation also comprehends a saliency map highlighting the areas that contribute to its classification, and areas that push it into another class. A wide and deep experimental evaluation proves that the proposed method outperforms existing explainers in terms of fidelity, relevance, coherence, and stability, besides providing the most useful and interpretable explanations.
2020
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Proceedings of the AAAI Conference on Artificial Intelligence
AAAI 2020 - Thirty-Fourth AAAI Conference on Artificial Intelligence
34
9
13665
13668
4
9781577358350
https://ojs.aaai.org/index.php/AAAI/article/view/7116
07-12/02/2020
New York, USA
Explainable AI
Elettronico
4
open
Guidotti, R.; Monreale, A.; Matwin, S.; Pedreschi, D.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   SoBigData Research Infrastructure
   SoBigData
   European Commission
   Horizon 2020 Framework Programme
   654024

   A European AI On Demand Platform and Ecosystem
   AI4EU
   European Commission
   Horizon 2020 Framework Programme
   825619

   Toward AI Systems That Augment and Empower Humans by Understanding Us, our Society and the World Around Us
   Humane AI
   European Commission
   Horizon 2020 Framework Programme
   820437

   PROmoting integrity in the use of RESearch results
   PRO-RES
   European Commission
   Horizon 2020 Framework Programme
   788352
File in questo prodotto:
File Dimensione Formato  
prod_460367-doc_179462.pdf

accesso aperto

Descrizione: abele_aaai
Tipologia: Versione Editoriale (PDF)
Licenza: Altro tipo di licenza
Dimensione 412.94 kB
Formato Adobe PDF
412.94 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/440614
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 13
social impact