Recurrent neural networks (RNN) are being extensively exploited in industry to address complex predictive tasks by leveraging on the increased availability of data from processes. However, the rationale behind model response is encoded in an implicit way, which is difficult to be explained by practitioners. If revealed, such mechanisms could provide deeper insights into RNN execution, enhancing conventional performance evaluations. We propose a new approach based on the introduction of a model-based clustering layer, constraining the network to operate on a discrete latent state representation. By processing context-input conditioned transitions between clusters, a Moore Machine characterizing the RNN computations is extracted. The proposed approach is demonstrated on both synthetic experiments from an open benchmark problem and via the application to a pilot industrial plant, by the behavior cloning of the flexible conveyor of a Remanufacturing process. The finite-state RNN attains the prediction accuracy of RNN with continuous state, providing in addition a more interpretable structure.

Learning behavioral models by recurrent neural networks with discrete latent representations with application to a flexible industrial conveyor

Brusaferri A
Primo
;
Spinelli S;Vitali A
2020

Abstract

Recurrent neural networks (RNN) are being extensively exploited in industry to address complex predictive tasks by leveraging on the increased availability of data from processes. However, the rationale behind model response is encoded in an implicit way, which is difficult to be explained by practitioners. If revealed, such mechanisms could provide deeper insights into RNN execution, enhancing conventional performance evaluations. We propose a new approach based on the introduction of a model-based clustering layer, constraining the network to operate on a discrete latent state representation. By processing context-input conditioned transitions between clusters, a Moore Machine characterizing the RNN computations is extracted. The proposed approach is demonstrated on both synthetic experiments from an open benchmark problem and via the application to a pilot industrial plant, by the behavior cloning of the flexible conveyor of a Remanufacturing process. The finite-state RNN attains the prediction accuracy of RNN with continuous state, providing in addition a more interpretable structure.
2020
Istituto di Sistemi e Tecnologie Industriali Intelligenti per il Manifatturiero Avanzato - STIIMA (ex ITIA)
Deep learning
Recurrent neural network
Discrete representation
Finite state machine
Behavior cloning
Industrial cyber physical systems
File in questo prodotto:
File Dimensione Formato  
Learning behavioral models by recurrent neural networks.pdf

accesso aperto

Tipologia: Documento in Post-print
Licenza: Creative commons
Dimensione 7.09 MB
Formato Adobe PDF
7.09 MB Adobe PDF Visualizza/Apri
1-s2.0-S0166361520304978-main.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 9.45 MB
Formato Adobe PDF
9.45 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/407552
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 5
social impact