Detecting deviant execution instances (e.g., related to security attacks, frauds, and faults~\cite{Dumas14}) of a business process is relevant for modern enterprises and organizations. Recent works~\cite{ismis22} demonstrated that learning deep \emph{Deviance Detection Models} (DDMs) out of process traces via (semi-)supervised methods outperforms traditional approaches based on standard Machine Learning solutions. However, deep models require to be trained with a large number of examples, which may not be available in real-life settings, especially in Green-AI applications where restrictions to data access and processing operations could be enforced~\cite{adadi2021}. To better suit such challenging applications, we propose a novel approach to discovering a deep DDM that deals with the scarcity of training data by leveraging a complementary self-supervised learning task. %\cite{mtl}. Specifically, the DDM discovery problem is formalized as a Multi-Task Learning one that simultaneously minimizes the classification loss and the reconstruction error of an auxiliary auto-encoder sub-net. This articulated learning scheme allows us to grasp an additional source of supervision complementary to data labels. For the sake of interpretability and faithful explanation of predictions, our DDM is trained with a tabular representation of process traces~\cite{Dumas14}, on top of which two parallel stacks of feature-representation layers are learned efficiently (and robustly to overfitting) by leveraging residual-like skip connections. The approach effectively deals with the challenging combination of data efficiency and explainability requirements in a case study concerning the execution traces of a real-life process. Future research will investigate alternative multi-objective optimization strategies and scalable uncertainty quantification methods to identify out-of-distribution instances.

Data-Efficient Deep Learning Approach for Explainable Process Deviance Discovery

Francesco Folino;Gianluigi Folino;Massimo Guarascio;Luigi Pontieri
2023

Abstract

Detecting deviant execution instances (e.g., related to security attacks, frauds, and faults~\cite{Dumas14}) of a business process is relevant for modern enterprises and organizations. Recent works~\cite{ismis22} demonstrated that learning deep \emph{Deviance Detection Models} (DDMs) out of process traces via (semi-)supervised methods outperforms traditional approaches based on standard Machine Learning solutions. However, deep models require to be trained with a large number of examples, which may not be available in real-life settings, especially in Green-AI applications where restrictions to data access and processing operations could be enforced~\cite{adadi2021}. To better suit such challenging applications, we propose a novel approach to discovering a deep DDM that deals with the scarcity of training data by leveraging a complementary self-supervised learning task. %\cite{mtl}. Specifically, the DDM discovery problem is formalized as a Multi-Task Learning one that simultaneously minimizes the classification loss and the reconstruction error of an auxiliary auto-encoder sub-net. This articulated learning scheme allows us to grasp an additional source of supervision complementary to data labels. For the sake of interpretability and faithful explanation of predictions, our DDM is trained with a tabular representation of process traces~\cite{Dumas14}, on top of which two parallel stacks of feature-representation layers are learned efficiently (and robustly to overfitting) by leveraging residual-like skip connections. The approach effectively deals with the challenging combination of data efficiency and explainability requirements in a case study concerning the execution traces of a real-life process. Future research will investigate alternative multi-objective optimization strategies and scalable uncertainty quantification methods to identify out-of-distribution instances.
2023
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Process Deviance Mining
Deep Multi-Task Learning
Green AI
File in questo prodotto:
File Dimensione Formato  
prod_483132-doc_199021.pdf

solo utenti autorizzati

Descrizione: 2023_Numta
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 112.39 kB
Formato Adobe PDF
112.39 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/460299
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact