Deep learned models are now largely adopted in different fields, and they generally provide superior performances with respect to classical signal-based approaches. Notwithstanding this, their actual reliability when working in an unprotected environment is far enough to be proven. In this work, we consider a novel deep neural network architecture, named Neural Ordinary Differential Equations (N-ODE), that is getting particular attention due to an attractive property--a test-time tunable trade-off between accuracy and efficiency. This paper analyzes the robustness of N-ODE image classifiers when faced against a strong adversarial attack and how its effectiveness changes when varying such a tunable trade-off. We show that adversarial robustness is increased when the networks operate in different tolerance regimes during test time and training time. On this basis, we propose a novel adversarial detection strategy for N-ODE nets based on the randomization of the adaptive ODE solver tolerance. Our evaluation performed on standard image classification benchmarks shows that our detection technique provides high rejection of adversarial examples while maintaining most of the original samples under white-box attacks and zero-knowledge adversaries.

Defending Neural ODE Image Classifiers from Adversarial Attacks with Tolerance Randomization

Carrara F;Falchi F;Amato G
2021

Abstract

Deep learned models are now largely adopted in different fields, and they generally provide superior performances with respect to classical signal-based approaches. Notwithstanding this, their actual reliability when working in an unprotected environment is far enough to be proven. In this work, we consider a novel deep neural network architecture, named Neural Ordinary Differential Equations (N-ODE), that is getting particular attention due to an attractive property--a test-time tunable trade-off between accuracy and efficiency. This paper analyzes the robustness of N-ODE image classifiers when faced against a strong adversarial attack and how its effectiveness changes when varying such a tunable trade-off. We show that adversarial robustness is increased when the networks operate in different tolerance regimes during test time and training time. On this basis, we propose a novel adversarial detection strategy for N-ODE nets based on the randomization of the adaptive ODE solver tolerance. Our evaluation performed on standard image classification benchmarks shows that our detection technique provides high rejection of adversarial examples while maintaining most of the original samples under white-box attacks and zero-knowledge adversaries.
2021
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
Alberto Del Bimbo, Rita Cucchiara, Stan Sclaroff, Giovanni Maria Farinella, Tao Mei, Marco Bertini, Hugo Jair Escalante, Roberto Vezzani
Pattern Recognition. ICPR International Workshops and Challenges Virtual Event, January 10-15, 2021, Proceedings, Part VI
International Conference on Pattern Recognition ICPR 2021
425
438
978-3-030-68779-3
https://link.springer.com/chapter/10.1007%2F978-3-030-68780-9_35
Sì, ma tipo non specificato
10-15/01/2021
Milano (Virtuale)
Neural ordinary differential equation
Adversarial defense
Image classification
3
partially_open
Carrara F.; Caldelli R.; Falchi F.; Amato G.
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   A European AI On Demand Platform and Ecosystem
   AI4EU
   H2020
   825619

   A European Excellence Centre for Media, Society and Democracy
   AI4Media
   H2020
   951911
File in questo prodotto:
File Dimensione Formato  
prod_454312-doc_175072.pdf

accesso aperto

Descrizione: Defending Neural ODE Image Classifiers from Adversarial Attacks with Tolerance Randomization
Tipologia: Versione Editoriale (PDF)
Dimensione 1 MB
Formato Adobe PDF
1 MB Adobe PDF Visualizza/Apri
prod_454312-doc_175124.pdf

non disponibili

Descrizione: Defending Neural ODE Image Classifiers from Adversarial Attacks with Tolerance Randomization
Tipologia: Versione Editoriale (PDF)
Dimensione 4.2 MB
Formato Adobe PDF
4.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/398281
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact