NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
A Byzantine resilient processor with an encoded fault-tolerant shared memoryThe memory requirements for ultra-reliable computers are expected to increase due to future increases in mission functionality and operating-system requirements. This increase will have a negative effect on the reliability and cost of the system. Increased memory size will also reduce the ability to reintegrate a channel after a transient fault, since the time required to reintegrate a channel in a conventional fault-tolerant processor is dominated by memory realignment time. A Byzantine Resilient Fault-Tolerant Processor with Fault-Tolerant Shared Memory (FTP/FTSM) is presented as a solution to these problems. The FTSM uses an encoded memory system, which reduces the memory requirement by one-half compared to a conventional quad-FTP design. This increases the reliability and decreases the cost of the system. The realignment problem is also addressed by the FTSM. Because any single error is corrected upon a read from the FTSM, a faulty channel's corrupted memory does not need realignment before reintegration of the faulty channel. A combination of correct-on-access and background scrubbing is proposed to prevent the accumulation of transient errors in the memory. With a hardware-implemented scrubber, the scrubbing cycle time, and therefore the memory fault latency, can be upper-bounded at a small value. This technique increases the reliability of the memory system and facilitates validation of its reliability model.
Document ID
19910003382
Acquisition Source
Legacy CDMS
Document Type
Conference Paper
Authors
Butler, Bryan
(Draper (Charles Stark) Lab., Inc. Cambridge, MA, United States)
Harper, Richard
(Draper (Charles Stark) Lab., Inc. Cambridge, MA, United States)
Date Acquired
August 14, 2013
Publication Date
April 1, 1990
Publication Information
Publication: AGARD, Fault Tolerant Design Concepts for Highly Integrated Flight Critical Guidance and Control Systems
Subject Category
Computer Operations And Hardware
Accession Number
91N12695
Funding Number(s)
CONTRACT_GRANT: NAS1-18565
Distribution Limits
Public
Copyright
Other
Document Inquiry

Available Downloads

There are no available downloads for this record.
No Preview Available