Publicações

191 entradas « ‹ 1 de 4 › »

2025

Leonarczyk, Ricardo; Mencagli, Gabriele; Griebler, Dalvan

Self-Adaptive Micro-Batching for Low-Latency GPU-Accelerated Stream Processing Journal Article doi

International Journal of Parallel Programming, 53 (2), pp. 14, 2025, ISSN: 0885-7458.

@article{LEONARCZYK:IJPP:25b,
title = {Self-Adaptive Micro-Batching for Low-Latency GPU-Accelerated Stream Processing},
author = {Ricardo Leonarczyk and Gabriele Mencagli and Dalvan Griebler},
doi = {10.1007/s10766-025-00793-4},
issn = {0885-7458},
year = {2025},
date = {2025-01-01},
journal = {International Journal of Parallel Programming},
volume = {53},
number = {2},
pages = {14},
abstract = {Stream processing is a computing paradigm enabling the continuous processing of unbounded data streams. Some classes of stream processing applications can greatly benefit from the parallel processing power and affordability offered by GPUs. However, efficient GPU utilization with stream processing applications often requires micro-batching techniques, i.e., the continuous processing of data batches to expose data parallelism opportunities and amortize host-device data transfer overheads. Micro-batching further introduces the challenge of finding suitable micro-batch sizes to maintain low-latency processing under highly dynamic workloads. The research field of self-adaptive software provides different techniques to address such a challenge. Our goal is to assess the performance of six self-adaptive algorithms in meeting latency requirements through micro-batch size adaptation. The algorithms are applied to a GPU-accelerated stream processing benchmark with a highly dynamic workload. Four of the six algorithms have already been evaluated using a smaller workload with the same application. We propose two new algorithms to address the shortcomings detected in the former four. The results demonstrate that a highly dynamic workload is challenging for the evaluated algorithms, as they could not meet the most strict latency requirements for more than 38.5% of the stream data items. Overall, all algorithms performed similarly in meeting the latency requirements. However, one of our proposed algorithms met the requirements for 4% more data items than the best of the previously studied algorithms, demonstrating more effectiveness in highly variable workloads. This effectiveness is particularly evident in segments of the workload with abrupt transitions between low- and high-latency regions, where our proposed algorithms met the requirements for 79% of the data items in those segments, compared to 33% for the best of the earlier algorithms.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Fechar

Mencagli, Gabriele; Rymarchuk, Yuriy; Griebler, Dalvan

PPOIJ: Shared-Nothing Parallel Patterns for Efficient Online Interval Joins over Data Streams Inproceedings doi

Proceedings of the 19th ACM International Conference on Distributed and Event-Based Systems, pp. 51-61, Association for Computing Machinery, New York, NY, USA, 2025.

Publicações

2025

2024

2023

2022

2021

2020