WP3 takes care of the continuous assessment and analysis of the parallel performance of the MAX flagship codes, pointing up the direction for the development aimed at the effective exploitation of the existing technology. To this aim, we make use of tools for code profiling and optimisation and of the most recent programming models. WP3 activity is functional to many other tasks in different work packages (WP1, WP2, WP4, WP5), in order to provide feedback on the progress obtained in terms of performance enhancement with respect to the relevant metrics and to the parallel efficiency. On the other hand, an important output of this WP is to discover and monitor code bottlenecks, to identify the code or architecture feature responsible for them (memory bandwidth, communication imbalance, latency, bandwidth to GPU, etc), and to propose dedicated solutions also through the engineering of ad-hoc proof-of-concepts. The solutions of these bottlenecks that require code refactoring or replacement of an algorithm will be implemented within WP1 by the code developers. Our activity will therefore be in continuous synergy with the developing teams of the MaX flagship codes. In the following we will describe how we have organised the benchmarking and profiling activity, introducing the tools adopted (Section 3) and presenting some preliminary results (Section 4, 5 and 6). The deployment status of flagship applications on EuroHPC systems will also be reported (Section 7).

D3.1 Interim report on performance analysis of MAX software

P. Delugas;N. Spallanzani;
2023

Abstract

WP3 takes care of the continuous assessment and analysis of the parallel performance of the MAX flagship codes, pointing up the direction for the development aimed at the effective exploitation of the existing technology. To this aim, we make use of tools for code profiling and optimisation and of the most recent programming models. WP3 activity is functional to many other tasks in different work packages (WP1, WP2, WP4, WP5), in order to provide feedback on the progress obtained in terms of performance enhancement with respect to the relevant metrics and to the parallel efficiency. On the other hand, an important output of this WP is to discover and monitor code bottlenecks, to identify the code or architecture feature responsible for them (memory bandwidth, communication imbalance, latency, bandwidth to GPU, etc), and to propose dedicated solutions also through the engineering of ad-hoc proof-of-concepts. The solutions of these bottlenecks that require code refactoring or replacement of an algorithm will be implemented within WP1 by the code developers. Our activity will therefore be in continuous synergy with the developing teams of the MaX flagship codes. In the following we will describe how we have organised the benchmarking and profiling activity, introducing the tools adopted (Section 3) and presenting some preliminary results (Section 4, 5 and 6). The deployment status of flagship applications on EuroHPC systems will also be reported (Section 7).
2023
Istituto Nanoscienze - NANO - Sede Secondaria Modena
Istituto Officina dei Materiali - IOM -
Rapporto intermedio di progetto
MaX
File in questo prodotto:
File Dimensione Formato  
D3.1_Interim report on performance analysis of MAX software.pdf

accesso aperto

Licenza: Dominio pubblico
Dimensione 976.87 kB
Formato Adobe PDF
976.87 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/516083
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact