NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Algorithm-Based Fault Tolerance Integrated with ReplicationIn a proposed approach to programming and utilization of commercial off-the-shelf computing equipment, a combination of algorithm-based fault tolerance (ABFT) and replication would be utilized to obtain high degrees of fault tolerance without incurring excessive costs. The basic idea of the proposed approach is to integrate ABFT with replication such that the algorithmic portions of computations would be protected by ABFT, and the logical portions by replication. ABFT is an extremely efficient, inexpensive, high-coverage technique for detecting and mitigating faults in computer systems used for algorithmic computations, but does not protect against errors in logical operations surrounding algorithms.
Document ID
20080047201
Acquisition Source
Jet Propulsion Laboratory
Document Type
Other - NASA Tech Brief
Authors
Some, Raphael
(California Inst. of Tech. Pasadena, CA, United States)
Rennels, David
(California Inst. of Tech. Pasadena, CA, United States)
Date Acquired
August 24, 2013
Publication Date
November 1, 2008
Publication Information
Publication: NASA Tech Briefs, November 2008
Subject Category
Computer Programming And Software
Report/Patent Number
NPO-43842
Distribution Limits
Public
Copyright
Public Use Permitted.
No Preview Available