NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Due to the lapse in federal government funding, NASA is not updating this website. We sincerely regret this inconvenience.

Back to Results
Certification of computational resultsA conceptually novel and powerful technique to achieve fault detection and fault tolerance in hardware and software systems is described. When used for software fault detection, this new technique uses time and software redundancy and can be outlined as follows. In the initial phase, a program is run to solve a problem and store the result. In addition, this program leaves behind a trail of data called a certification trail. In the second phase, another program is run which solves the original problem again. This program, however, has access to the certification trail left by the first program. Because of the availability of the certification trail, the second phase can be performed by a less complex program and can execute more quickly. In the final phase, the two results are compared and if they agree the results are accepted as correct; otherwise an error is indicated. An essential aspect of this approach is that the second program must always generate either an error indication or a correct output even when the certification trail it receives from the first program is incorrect. The certification trail approach to fault tolerance is formalized and realizations of it are illustrated by considering algorithms for the following problems: convex hull, sorting, and shortest path. Cases in which the second phase can be run concurrently with the first and act as a monitor are discussed. The certification trail approach are compared to other approaches to fault tolerance.
Document ID
19940031557
Acquisition Source
Legacy CDMS
Document Type
Other
Authors
Sullivan, Gregory F.
(Johns Hopkins Univ. Baltimore, MD, United States)
Wilson, Dwight S.
(Johns Hopkins Univ. Baltimore, MD, United States)
Masson, Gerald M.
(Johns Hopkins Univ. Baltimore, MD, United States)
Date Acquired
September 6, 2013
Publication Date
December 21, 1993
Publication Information
Publication: Transient Faults in Computer Systems
Subject Category
Computer Programming And Software
Accession Number
94N36064
Funding Number(s)
CONTRACT_GRANT: NSF CCR-89-08092
CONTRACT_GRANT: NSF CDA-90-15667
CONTRACT_GRANT: NSF CCR-89-10569
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available