License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.OPODIS.2018.19
URN: urn:nbn:de:0030-drops-100792
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2018/10079/
Kozhaya, David ;
Maric, Ognjen ;
Pignolet, Yvonne-Anne
You Only Live Multiple Times: A Blackbox Solution for Reusing Crash-Stop Algorithms In Realistic Crash-Recovery Settings
Abstract
Distributed agreement-based algorithms are often specified in a crash-stop asynchronous model augmented by Chandra and Toueg's unreliable failure detectors. In such models, correct nodes stay up forever, incorrect nodes eventually crash and remain down forever, and failure detectors behave correctly forever eventually, However, in reality, nodes as well as communication links both crash and recover without deterministic guarantees to remain in some state forever.
In this paper, we capture this realistic temporary and probabilitic behaviour in a simple new system model. Moreover, we identify a large algorithm class for which we devise a property-preserving transformation. Using this transformation, many algorithms written for the asynchronous crash-stop model run correctly and unchanged in real systems.
BibTeX - Entry
@InProceedings{kozhaya_et_al:LIPIcs:2018:10079,
author = {David Kozhaya and Ognjen Maric and Yvonne-Anne Pignolet},
title = {{You Only Live Multiple Times: A Blackbox Solution for Reusing Crash-Stop Algorithms In Realistic Crash-Recovery Settings}},
booktitle = {22nd International Conference on Principles of Distributed Systems (OPODIS 2018)},
pages = {19:1--19:17},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-098-9},
ISSN = {1868-8969},
year = {2018},
volume = {125},
editor = {Jiannong Cao and Faith Ellen and Luis Rodrigues and Bernardo Ferreira},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {http://drops.dagstuhl.de/opus/volltexte/2018/10079},
URN = {urn:nbn:de:0030-drops-100792},
doi = {10.4230/LIPIcs.OPODIS.2018.19},
annote = {Keywords: Crash recovery, consensus, asynchrony}
}
Keywords: |
|
Crash recovery, consensus, asynchrony |
Collection: |
|
22nd International Conference on Principles of Distributed Systems (OPODIS 2018) |
Issue Date: |
|
2018 |
Date of publication: |
|
15.01.2019 |