Coping with Dependent Process Failures

Flavio Junqueira, Keith Marzullo and M. Voelker Geoffrey
CS2002-0723
October 7, 2002

When developing fault-tolerant protocols, systems are usually modeled assuming that process failures are independent and identically distributed. In this paper, we present a system model that can represent correlated failures. We show that such a model is useful in that protocols can be made more efficient. Central to our approach is the idea of a core, which is a reliable minimal subset of processes. We present two probabilistic failure models for dependent failures and discuss them in terms of computing cores. For both, finding a smallest minimal core is NP-hard, but one of the two models can be strengthened in a natural way to make computing a minimal core in P.


How to view this document


The authors of these documents have submitted their reports to this technical report series for the purpose of non-commercial dissemination of scientific work. The reports are copyrighted by the authors, and their existence in electronic format does not imply that the authors have relinquished any rights. You may copy a report for scholarly, non-commercial purposes, such as research or instruction, provided that you agree to respect the author's copyright. For information concerning the use of this document for other than research or instructional purposes, contact the authors. Other information concerning this technical report series can be obtained from the Computer Science and Engineering Department at the University of California at San Diego, techreports@cs.ucsd.edu.


[ Search ]


NCSTRL
This server operates at UCSD Computer Science and Engineering.
Send email to webmaster@cs.ucsd.edu