This paper presents a novel and efficient method to compute one of the simplest and most useful building block for parallel algorithms: the parallel prefix sum operation. Besides its practical relevance, the problem achieves further interest in parallel-computation theory. We firstly describe step-by-step how parallel prefix sum is performed in parallel on GPUs. Next we propose a more efficient technique properly developed for modern graphics processors and alike processors. Our technique is able to perform the computation in such a way that minimizes both memory conflicts and memory usage. Finally we evaluate theoretically and empirically all the considered solutions in terms of efficiency, space complexity, and computational time. In order to properly conduct the theoretical analysis we used a novel computational model proposed by us in a previous work: K-model. Concerning the experiments, the results show that the proposed solution obtains better performance than the existing ones.

Designing Efficient Parallel Prefix Sum Algorithms for GPUs

2011

Abstract

This paper presents a novel and efficient method to compute one of the simplest and most useful building block for parallel algorithms: the parallel prefix sum operation. Besides its practical relevance, the problem achieves further interest in parallel-computation theory. We firstly describe step-by-step how parallel prefix sum is performed in parallel on GPUs. Next we propose a more efficient technique properly developed for modern graphics processors and alike processors. Our technique is able to perform the computation in such a way that minimizes both memory conflicts and memory usage. Finally we evaluate theoretically and empirically all the considered solutions in terms of efficiency, space complexity, and computational time. In order to properly conduct the theoretical analysis we used a novel computational model proposed by us in a previous work: K-model. Concerning the experiments, the results show that the proposed solution obtains better performance than the existing ones.
2011
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-1-4577-0383-6
GPGPU
File in questo prodotto:
File Dimensione Formato  
prod_206165-doc_46289.pdf

solo utenti autorizzati

Descrizione: contributo
Tipologia: Versione Editoriale (PDF)
Dimensione 131.17 kB
Formato Adobe PDF
131.17 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/182955
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? ND
social impact