KORD-I: A Framework for Real-Time Performance and Cost Optimization of Apache Spark Streaming

Athanasios Kordelas, Thanasis Spyrou, Spyros Voulgaris, Vasileios Megalooikonomou, Nikos Deligiannis

Onderzoeksoutput: Conference paper

Samenvatting

Apache Spark is one of the most commonly used
frameworks for Big Data processing. Research on the provided
streaming dynamic resource allocation feature, has been shown
that large data load fluctuations, for instance, in website traffic,
have a negative impact on the automatic scaling. Research has
also indicated that the lack of data load prediction, which
aims at the identification of the expected data load increase on
peak hours/days, is the root cause of the aforementioned issue.
Hence, this paper proposes an enhanced solution, namely, KORDI
(Knowledge-based Orchestrated Resource DIstribution), aiming
at optimising the allocation of Spark resources on Streaming
applications in real time with the use of SARIMAX model.
The experimental evaluation proves that the proposed solution
provides a cost reduction of 38% without affecting stability.
Originele taal-2English
Titel2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-23)
Pagina's1-3
Aantal pagina's3
StatusAccepted/In press - 2023
Evenement2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) - Raleigh, North Carolina, Raleigh, United States
Duur: 23 apr 202325 apr 2023
https://ispass.org/ispass2023/

Conference

Conference2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
Land/RegioUnited States
StadRaleigh
Periode23/04/2325/04/23
Internet adres

Vingerafdruk

Duik in de onderzoeksthema's van 'KORD-I: A Framework for Real-Time Performance and Cost Optimization of Apache Spark Streaming'. Samen vormen ze een unieke vingerafdruk.

Citeer dit