NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Fault recovery characteristics of the fault tolerant multi-processorThe fault handling performance of the fault tolerant multiprocessor (FTMP) was investigated. Fault handling errors detected during fault injection experiments were characterized. In these fault injection experiments, the FTMP disabled a working unit instead of the faulted unit once every 500 faults, on the average. System design weaknesses allow active faults to exercise a part of the fault management software that handles byzantine or lying faults. It is pointed out that these weak areas in the FTMP's design increase the probability that, for any hardware fault, a good LRU (line replaceable unit) is mistakenly disabled by the fault management software. It is concluded that fault injection can help detect and analyze the behavior of a system in the ultra-reliable regime. Although fault injection testing cannot be exhaustive, it has been demonstrated that it provides a unique capability to unmask problems and to characterize the behavior of a fault-tolerant system.
Document ID
19910069988
Document Type
Conference Paper
Authors
Padilla, Peter A. (NASA Langley Research Center Hampton, VA, United States)
Date Acquired
August 14, 2013
Publication Date
January 1, 1990
Subject Category
COMPUTER SYSTEMS
Meeting Information
IEEE/AIAA/NASA Digital Avionics Systems Conference(Virginia Beach, VA)
Distribution Limits
Public
Copyright
Other