Browse by author
Lookup NU author(s): Dr Paul EzhilchelvanORCiD, Emeritus Professor Santosh Shrivastava
Faults are causes of system failures. The function of a fault tolerant algorithm of a system is to detect failures of the system's components and to attempt to tolerate these failures so as to provide specified system services. The design of such an algorithm for any given system function requires making assumptions about the behaviour of faulty components. We present a fault and failure classification for components using 'timeliness' and 'expected value' as the two properties of a component's response. This classification helps the design of family of fault tolerant algorithms capable of tolerating faults - from the simple ones to the most complex ones. Examples from distributed systems are used to illustrate these ideas.
Author(s): Ezhilchelvan PD, Shrivastava SK
Publication type: Report
Publication status: Published
Series Title: Computing Laboratory Technical Report Series
Year: 1985
Pages: 21
Print publication date: 01/09/1985
Source Publication Date: September 1985
Report Number: 206
Institution: Computing Laboratory, University of Newcastle upon Tyne
Place Published: Newcastle upon Tyne
URL: http://www.cs.ncl.ac.uk/publications/trs/papers/206.pdf