A technique for dense linear system solution is presented which reaches maximum performances on attached processors like FPS-120, 5000 and X64 using the Fortran language with calls to the vector routines. Starting from the Dongarra's LU factorization algorithm the key idea is to carry out a pseudo-transposition of the lower triangular matrix L (including the main diagonal) around the minor diagonal. The pseudo-transposition allows to carry out all the matrix vector operations involved in LU factorization with only stride 1 dot product operations which, using the TM Auxiliary Memory and the TMDOT routine, can be executed in the FPS processor obtaining the maximum speed. Since the algorithm uses only vector instructions it is fully portable on all the FPS 38/64 bit machines and in general on all the vector computers with a similar memory structure. Furthermore the algorithm can be easily translated into the new FORTRAN 8X, which will probably become the standard for future SIMD computers for numerical applications.

How to reach maximum theoretical performance in solving linear equations systems on FPS Architecture 38/64 bits

A Corana;L Marconi;C Martini;G Morgavi;
1987

Abstract

A technique for dense linear system solution is presented which reaches maximum performances on attached processors like FPS-120, 5000 and X64 using the Fortran language with calls to the vector routines. Starting from the Dongarra's LU factorization algorithm the key idea is to carry out a pseudo-transposition of the lower triangular matrix L (including the main diagonal) around the minor diagonal. The pseudo-transposition allows to carry out all the matrix vector operations involved in LU factorization with only stride 1 dot product operations which, using the TM Auxiliary Memory and the TMDOT routine, can be executed in the FPS processor obtaining the maximum speed. Since the algorithm uses only vector instructions it is fully portable on all the FPS 38/64 bit machines and in general on all the vector computers with a similar memory structure. Furthermore the algorithm can be easily translated into the new FORTRAN 8X, which will probably become the standard for future SIMD computers for numerical applications.
Campo DC Valore Lingua
dc.authority.orgunit Istituto di Elettronica e di Ingegneria dell'Informazione e delle Telecomunicazioni - IEIIT -
dc.authority.people A Corana it
dc.authority.people L Marconi it
dc.authority.people C Martini it
dc.authority.people G Morgavi it
dc.authority.people S Ridella it
dc.authority.people C Rolando it
dc.collection.id.s 71c7200a-7c5f-4e83-8d57-d3d2ba88f40d *
dc.collection.name 04.01 Contributo in Atti di convegno *
dc.contributor.appartenenza Istituto di Elettronica e di Ingegneria dell'Informazione e delle Telecomunicazioni - IEIIT *
dc.contributor.appartenenza Istituto di linguistica computazionale "Antonio Zampolli" - ILC *
dc.contributor.appartenenza.mi 877 *
dc.contributor.appartenenza.mi 918 *
dc.date.accessioned 2024/02/20 09:44:56 -
dc.date.available 2024/02/20 09:44:56 -
dc.date.issued 1987 -
dc.description.abstracteng A technique for dense linear system solution is presented which reaches maximum performances on attached processors like FPS-120, 5000 and X64 using the Fortran language with calls to the vector routines. Starting from the Dongarra's LU factorization algorithm the key idea is to carry out a pseudo-transposition of the lower triangular matrix L (including the main diagonal) around the minor diagonal. The pseudo-transposition allows to carry out all the matrix vector operations involved in LU factorization with only stride 1 dot product operations which, using the TM Auxiliary Memory and the TMDOT routine, can be executed in the FPS processor obtaining the maximum speed. Since the algorithm uses only vector instructions it is fully portable on all the FPS 38/64 bit machines and in general on all the vector computers with a similar memory structure. Furthermore the algorithm can be easily translated into the new FORTRAN 8X, which will probably become the standard for future SIMD computers for numerical applications. -
dc.description.affiliations ICE-CNR; ICE-CNR; ICE-CNR; ICE-CNR ;ICE-CNR e DIBE-Università di Genova; ICE-CNR -
dc.description.allpeople Corana, A; Marconi, L; Martini, C; Morgavi, G; Ridella, S; Rolando, C -
dc.description.allpeopleoriginal A. Corana, L. Marconi, C. Martini, G. Morgavi, S. Ridella, C. Rolando -
dc.description.fulltext none en
dc.description.numberofauthors 6 -
dc.identifier.uri https://hdl.handle.net/20.500.14243/201227 -
dc.language.iso eng -
dc.relation.conferencedate 26-29 April 1987 -
dc.relation.conferencename Array Conference -
dc.relation.conferenceplace Montreal (Canada) -
dc.relation.firstpage 173 -
dc.relation.lastpage 179 -
dc.subject.keywords solution of linear equations; optimized algorithms; vector processors; FPS architecttures; performance evaluation -
dc.subject.singlekeyword solution of linear equations *
dc.subject.singlekeyword optimized algorithms *
dc.subject.singlekeyword vector processors *
dc.subject.singlekeyword FPS architecttures *
dc.subject.singlekeyword performance evaluation *
dc.title How to reach maximum theoretical performance in solving linear equations systems on FPS Architecture 38/64 bits en
dc.type.driver info:eu-repo/semantics/conferenceObject -
dc.type.full 04 Contributo in convegno::04.01 Contributo in Atti di convegno it
dc.type.miur 273 -
dc.type.referee Sì, ma tipo non specificato -
dc.ugov.descaux1 247767 -
iris.orcid.lastModifiedDate 2024/04/04 15:42:48 *
iris.orcid.lastModifiedMillisecond 1712238168399 *
iris.sitodocente.maxattempts 1 -
Appare nelle tipologie: 04.01 Contributo in Atti di convegno
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/201227
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact