An increasing number of application domains require high-throughput processing to extract insights from massive data streams. The Data Stream Processing (DSP) paradigm provides formal approaches to analyze structured data streams considered as special, unbounded relations. The most used class of stateful operators in DSP are the ones running sliding-window aggregation, which continuously extracts insights from the most recent portion of the stream. This article presents Springald, an efficient sliding-window operator leveraging GPU devices. Springald, incorporated in the WindFlow parallel library, processes out-of-order data streams with watermarks propagation. These two features-GPU processing and out-of-orderliness-make Springald a novel contribution to this research area. This article describes the methodology behind Springald, its design and implementation. We also provide an extensive experimental evaluation to understand the behavior of Springald deeply, and we showcase its superior performance against state-of-the-art competitors.

Springald: GPU-Accelerated Window-Based Aggregates over Out-of-Order Data Streams

Coppola M.
2024

Abstract

An increasing number of application domains require high-throughput processing to extract insights from massive data streams. The Data Stream Processing (DSP) paradigm provides formal approaches to analyze structured data streams considered as special, unbounded relations. The most used class of stateful operators in DSP are the ones running sliding-window aggregation, which continuously extracts insights from the most recent portion of the stream. This article presents Springald, an efficient sliding-window operator leveraging GPU devices. Springald, incorporated in the WindFlow parallel library, processes out-of-order data streams with watermarks propagation. These two features-GPU processing and out-of-orderliness-make Springald a novel contribution to this research area. This article describes the methodology behind Springald, its design and implementation. We also provide an extensive experimental evaluation to understand the behavior of Springald deeply, and we showcase its superior performance against state-of-the-art competitors.
2024
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Streams, Watermarking, Graphics processing units, Out of order, Aggregates, Distributed databases, Single instruction multiple data, Data stream processing, Window-based aggregates, Out-of-order data streams
File in questo prodotto:
File Dimensione Formato  
2024_IEEETRANS_Springald_GPU-Accelerated_Window-Based_Aggregates_Over_Out-of-Order_Data_Streams.pdf

accesso aperto

Descrizione: Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 3.18 MB
Formato Adobe PDF
3.18 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/535417
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact