Current workstations can offer really amazing raw computational power: up to 10 TFlops on a single machine equipped with multiple CPUs and accelerators as the Intel Xeon Phi or GPU devices. Such results can only be achieved with a massive parallelism of computational devices, thus the actual barrier posed by the exploitation of modern heterogeneous HPC resources is the difficulty in development and/or (performance) efficient porting of software on such architectures. In this paper, we present an experimental study about achievable performance of a widely used, computational intensive application the Fourier Transform, i.e. Discrete Fourier Transform (DFT) and Fast Fourier Transform. We propose an evaluation of the benefits obtained exploiting such resources in terms of performance and programming efforts in the development of the code with a emphasis on the programming approach adopted for code parallelization. With the exception of the interesting performance achieved exploiting GPU for the DFT algorithm, the use state-ofthe- art software libraries provide the best solution since they represent a good compromise to balance programming efforts and performance achievements. © 2014 IEEE.

Compute intensive algorithm on heterogeneous system: A case study about fourier transform

A Galizia;E Danovaro;G Ripepi;A Clematis
2014

Abstract

Current workstations can offer really amazing raw computational power: up to 10 TFlops on a single machine equipped with multiple CPUs and accelerators as the Intel Xeon Phi or GPU devices. Such results can only be achieved with a massive parallelism of computational devices, thus the actual barrier posed by the exploitation of modern heterogeneous HPC resources is the difficulty in development and/or (performance) efficient porting of software on such architectures. In this paper, we present an experimental study about achievable performance of a widely used, computational intensive application the Fourier Transform, i.e. Discrete Fourier Transform (DFT) and Fast Fourier Transform. We propose an evaluation of the benefits obtained exploiting such resources in terms of performance and programming efforts in the development of the code with a emphasis on the programming approach adopted for code parallelization. With the exception of the interesting performance achieved exploiting GPU for the DFT algorithm, the use state-ofthe- art software libraries provide the best solution since they represent a good compromise to balance programming efforts and performance achievements. © 2014 IEEE.
2014
Istituto di Matematica Applicata e Tecnologie Informatiche - IMATI -
Inglese
2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing
2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing
223
227
http://www.scopus.com/inward/record.url?eid=2-s2.0-84899445044&partnerID=q2rCbXpz
IEEE COMPUTER SOC
LOS ALAMITOS, CA
STATI UNITI D'AMERICA
Sì, ma tipo non specificato
12-14 Febbraio 2014
Torino, Italia
Complex Heterogeneous System
Fourier Transform
Parallel Programming Model
4
restricted
Galizia, A; Danovaro, E; Ripepi, G; Clematis, A
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_281025-doc_79885.pdf

solo utenti autorizzati

Descrizione: Compute intensive algorithm on heterogeneous system: A case study about fourier transform
Dimensione 165.55 kB
Formato Adobe PDF
165.55 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/244990
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact