NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Progressive retry for software error recovery in distributed systemsIn this paper, we describe a method of execution retry for bypassing software errors based on checkpointing, rollback, message reordering and replaying. We demonstrate how rollback techniques, previously developed for transient hardware failure recovery, can also be used to recover from software faults by exploiting message reordering to bypass software errors. Our approach intentionally increases the degree of nondeterminism and the scope of rollback when a previous retry fails. Examples from our experience with telecommunications software systems illustrate the benefits of the scheme.
Document ID
19940034732
Acquisition Source
Legacy CDMS
Document Type
Conference Paper
Authors
Wang, Yi-Min
(Illinois Univ. Urbana, United States)
Huang, Yennun
(AT&T Bell Labs. Murray Hill, NJ, United States)
Fuchs, W. K.
(Illinois Univ. Urbana, United States)
Date Acquired
August 16, 2013
Publication Date
June 1, 1993
Subject Category
Computer Programming And Software
Report/Patent Number
AD-A274289
Meeting Information
Meeting: IEEE, International Symposium on Fault-Tolerant Computing
Location: Toulouse
Country: France
Start Date: June 22, 1993
End Date: June 24, 1993
Sponsors: IEEE
Accession Number
94A11387
Funding Number(s)
CONTRACT_GRANT: NAG1-613
CONTRACT_GRANT: N00014-91-J-1283
Distribution Limits
Public
Copyright
Other

Available Downloads

There are no available downloads for this record.
No Preview Available