This paper describes the implementation of a highly available multimicroprocessorsystem based on the M.A.R.A,(Modular Architecture for Real Time Processing) multimicroprocessor architecture developed by Sele- nia. The proposed solution is based on the full duplication of a M.A.R.A. node. Fault detection is independently performed on the two nodes. At every time, only one of the two nodes (Master Node) controls the applica- tion, while the other node (Slave Node) is always ready to substitute the Master node. In case of a detected fault, the Master node communicates its unavailability to the Slave Node, and disconnects itself from the applica- tion. The Slave Node resumes the control of the application starting from the most recent consistent system state.To avoid any loss of application, the Slave Node during normal operation continuously maintains and updates a copy of the system state on the basis of informations provided to it by the Master Node. The application software is structured in atomic transac- tions. An atomic transaction is intended as a set of actions which updates the system state so that either all the actions have been executed (transaction commit), or none will have been (transaction abort). Therefore in case of a fault in the Master Node, the consistent system state is resumed by the Slave Node by aborting the not yet committed transactions and restoring them; it also completes the commit phases of the transactions which were committing in the Master Node when the fault has happened. The application software is supported by a run-time environment which makes transparent the duplicated configuration of the system.

A highly available multimicroprocessor system for real-time applications

Ciompi P;
1983

Abstract

This paper describes the implementation of a highly available multimicroprocessorsystem based on the M.A.R.A,(Modular Architecture for Real Time Processing) multimicroprocessor architecture developed by Sele- nia. The proposed solution is based on the full duplication of a M.A.R.A. node. Fault detection is independently performed on the two nodes. At every time, only one of the two nodes (Master Node) controls the applica- tion, while the other node (Slave Node) is always ready to substitute the Master node. In case of a detected fault, the Master node communicates its unavailability to the Slave Node, and disconnects itself from the applica- tion. The Slave Node resumes the control of the application starting from the most recent consistent system state.To avoid any loss of application, the Slave Node during normal operation continuously maintains and updates a copy of the system state on the basis of informations provided to it by the Master Node. The application software is structured in atomic transac- tions. An atomic transaction is intended as a set of actions which updates the system state so that either all the actions have been executed (transaction commit), or none will have been (transaction abort). Therefore in case of a fault in the Master Node, the consistent system state is resumed by the Slave Node by aborting the not yet committed transactions and restoring them; it also completes the commit phases of the transactions which were committing in the Master Node when the fault has happened. The application software is supported by a run-time environment which makes transparent the duplicated configuration of the system.
1983
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Computer applications
microprocessors
multiprocessing systems
File in questo prodotto:
File Dimensione Formato  
prod_421303-doc_149520.pdf

solo utenti autorizzati

Descrizione: A highly available multimicroprocessor system for real-time applications
Tipologia: Versione Editoriale (PDF)
Dimensione 239.46 kB
Formato Adobe PDF
239.46 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/410867
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact