(Process) Outcome Prediction entails predicting a discrete property of an unfinished process instance from its partial trace. Various outcome predictors discovered via Machine Learning (ML) methods, like rule/tree ensembles and (deep) neural networks, have achieved top accuracy performances. However, their opaqueness makes them unsuitable for scenarios necessitating understandable outcome predictors. Aligning with recent efforts to mine inherently interpretable predictors, we suggest training a sparse Mixture-of-Experts, with the ``gate'' and ``expert'' sub-nets being Logistic Regressors. This ensemble of specialized predictors is trained in a end-to-end way while restricting the number of input features used in the sub-nets, as an alternative to typical multi-step/objective mining pipelines (including, e.g., a global feature selection step followed by an ML one). This enables different experts to focus on varied input features for predicting the outcomes of instances in their competency regions. Test results on benchmark logs confirmed the ability of this approach to reach a compelling trade-off between accuracy and interpretability compared to existing solutions.

Sparse Mixtures of Shallow Linear Experts for Interpretable and Fast Outcome Prediction

Francesco Folino;Luigi Pontieri;Pietro Sabatino
2023

Abstract

(Process) Outcome Prediction entails predicting a discrete property of an unfinished process instance from its partial trace. Various outcome predictors discovered via Machine Learning (ML) methods, like rule/tree ensembles and (deep) neural networks, have achieved top accuracy performances. However, their opaqueness makes them unsuitable for scenarios necessitating understandable outcome predictors. Aligning with recent efforts to mine inherently interpretable predictors, we suggest training a sparse Mixture-of-Experts, with the ``gate'' and ``expert'' sub-nets being Logistic Regressors. This ensemble of specialized predictors is trained in a end-to-end way while restricting the number of input features used in the sub-nets, as an alternative to typical multi-step/objective mining pipelines (including, e.g., a global feature selection step followed by an ML one). This enables different experts to focus on varied input features for predicting the outcomes of instances in their competency regions. Test results on benchmark logs confirmed the ability of this approach to reach a compelling trade-off between accuracy and interpretability compared to existing solutions.
2023
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Process Mining
Machine Learning
XAI
Green AI
File in questo prodotto:
File Dimensione Formato  
ICPM2023_WORKSHOP.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 866.93 kB
Formato Adobe PDF
866.93 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/452174
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact