Detecting adversarial inputs by looking in the black box

Carrara, F; Falchi, F; Amato, G; Becarelli, R; Caldelli, R

The astonishing and cryptic effectiveness of Deep Neural Networks comes with the critical vulnerability to adversarial inputs - samples maliciously crafted to confuse and hinder machine learning models. Insights into the internal representations learned by deep models can help to explain their decisions and estimate their confidence, which can enable us to trace, characterise, and filter out adversarial attacks.

Detecting adversarial inputs by looking in the black box

Carrara F;Falchi F;Amato G;Becarelli R;Caldelli R

2019

Abstract

The astonishing and cryptic effectiveness of Deep Neural Networks comes with the critical vulnerability to adversarial inputs - samples maliciously crafted to confuse and hinder machine learning models. Insights into the internal representations learned by deep models can help to explain their decisions and estimate their confidence, which can enable us to trace, characterise, and filter out adversarial attacks.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Strutture organizzative
	
				Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
			
	Parole chiave
	
				Adversarial example
Deep neural networks
Image classification
Adversarial image detection
Representation learning
			
	Appare nelle tipologie:
	
				01.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
prod_404617-doc_150368.pdf accesso aperto Descrizione: Detecting adversarial inputs by looking in the black box Tipologia: Versione Editoriale (PDF) Dimensione 577.72 kB Formato Adobe PDF Visualizza/Apri	577.72 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/365256

Citazioni

ND

ND

0

CNR Institutional Research Information System

Detecting adversarial inputs by looking in the black box

Carrara F;Falchi F;Amato G;Becarelli R;Caldelli R

2019

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

CNR Institutional Research Information System

Detecting adversarial inputs by looking in the black box

Carrara F;Falchi F;Amato G;Becarelli R;Caldelli R

2019

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)