CNR Institutional Research Information System

Neural networks are now used in many sectors of our daily life thanks to efficient solutions such instruments provide for diverse tasks. Leaving to artificial intelligence the chance to make choices on behalf of humans inevitably exposes these tools to be fraudulently attacked. In fact, adversarial examples, intentionally crafted to fool a neural network, can dangerously induce a misclassification though appearing innocuous for a human observer. On such a basis, this paper focuses on the problem of image classification and proposes an analysis to better insight what happens inside a convolutional neural network (CNN) when it evaluates an adversarial example. In particular, the activations of the internal network layers have been analyzed and exploited to design possible countermeasures to reduce CNN vulnerability. Experimental results confirm that layer activations can be adopted to detect adversarial inputs.

Exploiting CNN layer activations to improve adversarial image classification

Caldelli R;Becarelli R;Carrara F;Falchi F;Amato G

2019

Abstract

Neural networks are now used in many sectors of our daily life thanks to efficient solutions such instruments provide for diverse tasks. Leaving to artificial intelligence the chance to make choices on behalf of humans inevitably exposes these tools to be fraudulently attacked. In fact, adversarial examples, intentionally crafted to fool a neural network, can dangerously induce a misclassification though appearing innocuous for a human observer. On such a basis, this paper focuses on the problem of image classification and proposes an analysis to better insight what happens inside a convolutional neural network (CNN) when it evaluates an adversarial example. In particular, the activations of the internal network layers have been analyzed and exploited to design possible countermeasures to reduce CNN vulnerability. Experimental results confirm that layer activations can be adopted to detect adversarial inputs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Lingua/e
	
				Inglese
			
	Serie
	
				PROCEEDINGS - INTERNATIONAL CONFERENCE ON IMAGE PROCESSING
			
	Titolo del convegno
	
				ICIP 2019 - IEEE International Conference on Image Processing
			
	Volume
	
				2019-September
			
	Da pagina
	
				2289
			
	A pagina
	
				2293
			
	Codice ISBN
	
				978-1-5386-6249-6
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ICIP.2019.8803776
			
	URL
	
				https://ieeexplore.ieee.org/document/8803776
			
	Referee
	
				Sì, ma tipo non specificato
			
	Periodo del Convegno
	
				22-25 September, 2019
			
	Luogo del Convegno
	
				Taipei, Taiwan
			
	Parole chiave
	
				Adversarial images
neural networks
layer activations
adversarial detection
			
	Codice Scopus
	
				2-s2.0-85076808262
			
	Codice Web of Science
	
				WOS:000521828602082
			
	Numero autori
	
				3
			
	Fulltext
	
				partially_open
			
	Tutti gli autori
	
						Caldelli R.; Becarelli R.; Carrara F.; Falchi F.; Amato G.
					
	Tipologia Login Miur
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04 Contributo in convegno::04.01 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				04.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_422758-doc_150374.pdf non disponibili Descrizione: Exploiting CNN layer activations to improve adversarial image classification Tipologia: Versione Editoriale (PDF) Dimensione 469.89 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	469.89 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
prod_422758-doc_160005.pdf accesso aperto Descrizione: Exploiting CNN layer activations to improve adversarial image classification Tipologia: Versione Editoriale (PDF) Dimensione 451.42 kB Formato Adobe PDF Visualizza/Apri	451.42 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/379901

Citazioni

ND

5

5

social impact