WP3 takes care of the continuous assessment and analysis of the parallel performance of the MAX flagship codes, pointing up the direction for the development aimed at the effective exploitation of the existing technology. To this aim, we make use of tools for code profiling and optimisation and of the most recent programming models. WP3 activity is functional to many other tasks in different work packages (WP1, WP2, WP4, WP5), in order to provide feedback on the progress obtained in terms of performance enhancement with respect to the relevant metrics and to the parallel efficiency. On the other hand, an important output of this WP is to discover and monitor code bottlenecks, to identify the code or architecture feature responsible for them (memory bandwidth, communication imbalance, latency, bandwidth to GPU, etc), and to propose dedicated solutions also through the engineering of ad-hoc proof-of-concepts. The solutions of these bottlenecks that require code refactoring or replacement of an algorithm will be implemented within WP1 by the code developers. Our activity will therefore be in continuous synergy with the developing teams of the MaX flagship codes. In the following we will describe how we have organised the benchmarking and profiling activity, introducing the tools adopted (Section 3) and presenting some preliminary results (Section 4, 5 and 6). The deployment status of flagship applications on EuroHPC systems will also be reported (Section 7).
D3.1 Interim report on performance analysis of MAX software
P. Delugas;N. Spallanzani;
2023
Abstract
WP3 takes care of the continuous assessment and analysis of the parallel performance of the MAX flagship codes, pointing up the direction for the development aimed at the effective exploitation of the existing technology. To this aim, we make use of tools for code profiling and optimisation and of the most recent programming models. WP3 activity is functional to many other tasks in different work packages (WP1, WP2, WP4, WP5), in order to provide feedback on the progress obtained in terms of performance enhancement with respect to the relevant metrics and to the parallel efficiency. On the other hand, an important output of this WP is to discover and monitor code bottlenecks, to identify the code or architecture feature responsible for them (memory bandwidth, communication imbalance, latency, bandwidth to GPU, etc), and to propose dedicated solutions also through the engineering of ad-hoc proof-of-concepts. The solutions of these bottlenecks that require code refactoring or replacement of an algorithm will be implemented within WP1 by the code developers. Our activity will therefore be in continuous synergy with the developing teams of the MaX flagship codes. In the following we will describe how we have organised the benchmarking and profiling activity, introducing the tools adopted (Section 3) and presenting some preliminary results (Section 4, 5 and 6). The deployment status of flagship applications on EuroHPC systems will also be reported (Section 7).File | Dimensione | Formato | |
---|---|---|---|
D3.1_Interim report on performance analysis of MAX software.pdf
accesso aperto
Licenza:
Dominio pubblico
Dimensione
976.87 kB
Formato
Adobe PDF
|
976.87 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.