An increasing number of application domains require high-throughput processing to extract insights from massive data streams. The Data Stream Processing (DSP) paradigm provides formal approaches to analyze structured data streams considered as special, unbounded relations. The most used class of stateful operators in DSP are the ones running sliding-window aggregation, which continuously extracts insights from the most recent portion of the stream. This article presents Springald, an efficient sliding-window operator leveraging GPU devices. Springald, incorporated in the WindFlow parallel library, processes out-of-order data streams with watermarks propagation. These two features-GPU processing and out-of-orderliness-make Springald a novel contribution to this research area. This article describes the methodology behind Springald, its design and implementation. We also provide an extensive experimental evaluation to understand the behavior of Springald deeply, and we showcase its superior performance against state-of-the-art competitors.
Springald: GPU-Accelerated Window-Based Aggregates over Out-of-Order Data Streams
Coppola M.
2024
Abstract
An increasing number of application domains require high-throughput processing to extract insights from massive data streams. The Data Stream Processing (DSP) paradigm provides formal approaches to analyze structured data streams considered as special, unbounded relations. The most used class of stateful operators in DSP are the ones running sliding-window aggregation, which continuously extracts insights from the most recent portion of the stream. This article presents Springald, an efficient sliding-window operator leveraging GPU devices. Springald, incorporated in the WindFlow parallel library, processes out-of-order data streams with watermarks propagation. These two features-GPU processing and out-of-orderliness-make Springald a novel contribution to this research area. This article describes the methodology behind Springald, its design and implementation. We also provide an extensive experimental evaluation to understand the behavior of Springald deeply, and we showcase its superior performance against state-of-the-art competitors.File | Dimensione | Formato | |
---|---|---|---|
2024_IEEETRANS_Springald_GPU-Accelerated_Window-Based_Aggregates_Over_Out-of-Order_Data_Streams.pdf
accesso aperto
Descrizione: Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
3.18 MB
Formato
Adobe PDF
|
3.18 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.