CNR Institutional Research Information System

(Process) Outcome Prediction entails predicting a discrete property of an unfinished process instance from its partial trace. Various outcome predictors discovered via Machine Learning (ML) methods, like rule/tree ensembles and (deep) neural networks, have achieved top accuracy performances. However, their opaqueness makes them unsuitable for scenarios necessitating understandable outcome predictors. Aligning with recent efforts to mine inherently interpretable predictors, we suggest training a sparse Mixture-of-Experts, with the ``gate'' and ``expert'' sub-nets being Logistic Regressors. This ensemble of specialized predictors is trained in a end-to-end way while restricting the number of input features used in the sub-nets, as an alternative to typical multi-step/objective mining pipelines (including, e.g., a global feature selection step followed by an ML one). This enables different experts to focus on varied input features for predicting the outcomes of instances in their competency regions. Test results on benchmark logs confirmed the ability of this approach to reach a compelling trade-off between accuracy and interpretability compared to existing solutions.

Sparse Mixtures of Shallow Linear Experts for Interpretable and Fast Outcome Prediction

Francesco Folino;Luigi Pontieri;Pietro Sabatino

2023

Abstract

(Process) Outcome Prediction entails predicting a discrete property of an unfinished process instance from its partial trace. Various outcome predictors discovered via Machine Learning (ML) methods, like rule/tree ensembles and (deep) neural networks, have achieved top accuracy performances. However, their opaqueness makes them unsuitable for scenarios necessitating understandable outcome predictors. Aligning with recent efforts to mine inherently interpretable predictors, we suggest training a sparse Mixture-of-Experts, with the ``gate'' and ``expert'' sub-nets being Logistic Regressors. This ensemble of specialized predictors is trained in a end-to-end way while restricting the number of input features used in the sub-nets, as an alternative to typical multi-step/objective mining pipelines (including, e.g., a global feature selection step followed by an ML one). This enables different experts to focus on varied input features for predicting the outcomes of instances in their competency regions. Test results on benchmark logs confirmed the ability of this approach to reach a compelling trade-off between accuracy and interpretability compared to existing solutions.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Strutture organizzative
	
				Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
			
	Parole chiave
	
				Process Mining
Machine Learning
XAI
Green AI

File in questo prodotto:

File	Dimensione	Formato
ICPM2023_WORKSHOP.pdf solo utenti autorizzati Tipologia: Versione Editoriale (PDF) Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 866.93 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	866.93 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/452174

Citazioni

ND

0

ND

social impact