Self-diagnosis of systems comprising large numbers of processors has been studied extensively in the literature. The APEmille SIMD machine, a project of the National Institute of Nuclear Physics (INFN)of Italy, was offered as a test bed for a self-diagnosis strategy based on a comparison model. Because of the general machine architecture and some design constraints,the standard assumptions of the existing diagnosis models are not completely ful_lled by the diagnosis support built in APEmille. This circumstance led to the development of a specific diagnostic model derived from the PMC and comparison models. The new model introduces the concept of direction-related and direction-independent faults. The consistency of this model with the APEmille architecture is discussed, and possible fault scenarios which are particularly critical for the correctness of the diagnosis are examined. It is shown that the limited hardware redundancy, extended with simple functional tests, is sufficient for obtaining valid diagnosis with the presented model.

Diagnostic model and diagnosis algorithm of a SIMD computer

Chessa S;
1999

Abstract

Self-diagnosis of systems comprising large numbers of processors has been studied extensively in the literature. The APEmille SIMD machine, a project of the National Institute of Nuclear Physics (INFN)of Italy, was offered as a test bed for a self-diagnosis strategy based on a comparison model. Because of the general machine architecture and some design constraints,the standard assumptions of the existing diagnosis models are not completely ful_lled by the diagnosis support built in APEmille. This circumstance led to the development of a specific diagnostic model derived from the PMC and comparison models. The new model introduces the concept of direction-related and direction-independent faults. The consistency of this model with the APEmille architecture is discussed, and possible fault scenarios which are particularly critical for the correctness of the diagnosis are examined. It is shown that the limited hardware redundancy, extended with simple functional tests, is sufficient for obtaining valid diagnosis with the presented model.
1999
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
3-540-66483-1
Fault-tolerance
System-level diagnosis
Reliability
testing and fault tolerance
Performance of systems
File in questo prodotto:
File Dimensione Formato  
prod_407702-doc_142917.pdf

solo utenti autorizzati

Descrizione: Diagnostic model and diagnosis algorithm of a SIMD computer
Tipologia: Versione Editoriale (PDF)
Dimensione 252.14 kB
Formato Adobe PDF
252.14 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/394334
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact