(Process) Outcome Prediction entails predicting a discrete property of an unfinished process instance from its partial trace. Various outcome predictors discovered via Machine Learning (ML) methods, like rule/tree ensembles and (deep) neural networks, have achieved top accuracy performances. However, their opaqueness makes them unsuitable for scenarios necessitating understandable outcome predictors. Aligning with recent efforts to mine inherently interpretable predictors, we suggest training a sparse Mixture-of-Experts, with the ``gate'' and ``expert'' sub-nets being Logistic Regressors. This ensemble of specialized predictors is trained in a end-to-end way while restricting the number of input features used in the sub-nets, as an alternative to typical multi-step/objective mining pipelines (including, e.g., a global feature selection step followed by an ML one). This enables different experts to focus on varied input features for predicting the outcomes of instances in their competency regions. Test results on benchmark logs confirmed the ability of this approach to reach a compelling trade-off between accuracy and interpretability compared to existing solutions.
Sparse Mixtures of Shallow Linear Experts for Interpretable and Fast Outcome Prediction
Francesco Folino;Luigi Pontieri;Pietro Sabatino
2023
Abstract
(Process) Outcome Prediction entails predicting a discrete property of an unfinished process instance from its partial trace. Various outcome predictors discovered via Machine Learning (ML) methods, like rule/tree ensembles and (deep) neural networks, have achieved top accuracy performances. However, their opaqueness makes them unsuitable for scenarios necessitating understandable outcome predictors. Aligning with recent efforts to mine inherently interpretable predictors, we suggest training a sparse Mixture-of-Experts, with the ``gate'' and ``expert'' sub-nets being Logistic Regressors. This ensemble of specialized predictors is trained in a end-to-end way while restricting the number of input features used in the sub-nets, as an alternative to typical multi-step/objective mining pipelines (including, e.g., a global feature selection step followed by an ML one). This enables different experts to focus on varied input features for predicting the outcomes of instances in their competency regions. Test results on benchmark logs confirmed the ability of this approach to reach a compelling trade-off between accuracy and interpretability compared to existing solutions.File | Dimensione | Formato | |
---|---|---|---|
ICPM2023_WORKSHOP.pdf
solo utenti autorizzati
Tipologia:
Versione Editoriale (PDF)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
866.93 kB
Formato
Adobe PDF
|
866.93 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.