Asynchronous snapshots of actor systems for latency-sensitive applications

Dominik Aumayr, Stefan Marr, Elisa Gonzalez Boix, Hanspeter Mossenbock

Research output: Chapter in Book/Report/Conference proceedingConference paperResearch

1 Citation (Scopus)

Abstract

The actor model is popular for many types of server applications. Efficient snapshotting of applications is crucial in the deployment of pre-initialized applications or moving running applications to different machines, e.g for debugging purposes. A key issue is that snapshotting blocks all other operations. In modern latency-sensitive applications, stopping the application to persist its state needs to be avoided, because users may not tolerate the increased request latency. In order to minimize the impact of snapshotting on request latency, our approach persists the application's state asynchronously by capturing partial heaps, completing snapshots step by step. Additionally, our solution is transparent and supports arbitrary object graphs. We prototyped our snapshotting approach on top of the Truffle/Graal platform and evaluated it with the Savina benchmarks and the Acme Air microservice application. When performing a snapshot every thousand Acme Air requests, the number of slow requests ( 0.007% of all requests) with latency above 100ms increases by 5.43%. Our Savina microbenchmark results detail how different utilization patterns impact snapshotting cost. To the best of our knowledge, this is the first system that enables asynchronous snapshotting of actor applications, i.e. without stop-The-world synchronization, and thereby minimizes the impact on latency. We thus believe it enables new deployment and debugging options for actor systems. c 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Original languageEnglish
Title of host publicationProceedings of the 16th ACM SIGPLAN International Conference on Managed Programming Languages and Runtimes
EditorsAntony L. Hosking, Irene Finocchi
PublisherACM New York
Pages157-171
EditionMPLR 2019
ISBN (Electronic)978-1-4503-6977-0
DOIs
Publication statusPublished - 23 Oct 2019
EventACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity - Athens, Greece
Duration: 20 Oct 201925 Oct 2019
https://2019.splashcon.org/

Conference

ConferenceACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity
Abbreviated titleSPLASH 2019
CountryGreece
CityAthens
Period20/10/1925/10/19
Internet address

Fingerprint Dive into the research topics of 'Asynchronous snapshots of actor systems for latency-sensitive applications'. Together they form a unique fingerprint.

Cite this