CNR Institutional Research Information System

Detecting deviant execution instances (e.g., related to security attacks, frauds, and faults~\cite{Dumas14}) of a business process is relevant for modern enterprises and organizations. Recent works~\cite{ismis22} demonstrated that learning deep \emph{Deviance Detection Models} (DDMs) out of process traces via (semi-)supervised methods outperforms traditional approaches based on standard Machine Learning solutions. However, deep models require to be trained with a large number of examples, which may not be available in real-life settings, especially in Green-AI applications where restrictions to data access and processing operations could be enforced~\cite{adadi2021}. To better suit such challenging applications, we propose a novel approach to discovering a deep DDM that deals with the scarcity of training data by leveraging a complementary self-supervised learning task. %\cite{mtl}. Specifically, the DDM discovery problem is formalized as a Multi-Task Learning one that simultaneously minimizes the classification loss and the reconstruction error of an auxiliary auto-encoder sub-net. This articulated learning scheme allows us to grasp an additional source of supervision complementary to data labels. For the sake of interpretability and faithful explanation of predictions, our DDM is trained with a tabular representation of process traces~\cite{Dumas14}, on top of which two parallel stacks of feature-representation layers are learned efficiently (and robustly to overfitting) by leveraging residual-like skip connections. The approach effectively deals with the challenging combination of data efficiency and explainability requirements in a case study concerning the execution traces of a real-life process. Future research will investigate alternative multi-objective optimization strategies and scalable uncertainty quantification methods to identify out-of-distribution instances.

Data-Efficient Deep Learning Approach for Explainable Process Deviance Discovery

Francesco Folino;Gianluigi Folino;Massimo Guarascio;Luigi Pontieri

2023

Abstract

Detecting deviant execution instances (e.g., related to security attacks, frauds, and faults~\cite{Dumas14}) of a business process is relevant for modern enterprises and organizations. Recent works~\cite{ismis22} demonstrated that learning deep \emph{Deviance Detection Models} (DDMs) out of process traces via (semi-)supervised methods outperforms traditional approaches based on standard Machine Learning solutions. However, deep models require to be trained with a large number of examples, which may not be available in real-life settings, especially in Green-AI applications where restrictions to data access and processing operations could be enforced~\cite{adadi2021}. To better suit such challenging applications, we propose a novel approach to discovering a deep DDM that deals with the scarcity of training data by leveraging a complementary self-supervised learning task. %\cite{mtl}. Specifically, the DDM discovery problem is formalized as a Multi-Task Learning one that simultaneously minimizes the classification loss and the reconstruction error of an auxiliary auto-encoder sub-net. This articulated learning scheme allows us to grasp an additional source of supervision complementary to data labels. For the sake of interpretability and faithful explanation of predictions, our DDM is trained with a tabular representation of process traces~\cite{Dumas14}, on top of which two parallel stacks of feature-representation layers are learned efficiently (and robustly to overfitting) by leveraging residual-like skip connections. The approach effectively deals with the challenging combination of data efficiency and explainability requirements in a case study concerning the execution traces of a real-life process. Future research will investigate alternative multi-objective optimization strategies and scalable uncertainty quantification methods to identify out-of-distribution instances.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Strutture organizzative
	
				Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
			
	Parole chiave
	
				Process Deviance Mining
Deep Multi-Task Learning
Green AI
			
	Appare nelle tipologie:
	
				04.02 Abstract in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
prod_483132-doc_199021.pdf solo utenti autorizzati Descrizione: 2023_Numta Tipologia: Versione Editoriale (PDF) Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 112.39 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	112.39 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/460299

Citazioni

ND

ND

ND

social impact