Reducing Interprocessor Dependence in Recoverable Distributed Shared MemoryCheckpointing techniques in parallel systems use dependency tracking and/or message logging to ensure that a system rolls back to a consistent state. Traditional dependency tracking in distributed shared memory (DSM) systems is expensive because of high communication frequency. In this paper we show that, if designed correctly, a DSM system only needs to consider dependencies due to the transfer of blocks of data, resulting in reduced dependency tracking overhead and reduced potential for rollback propagation. We develop an ownership timestamp scheme to tolerate the loss of block state information and develop a passive server model of execution where interactions between processors are considered atomic. With our scheme, dependencies are significantly reduced compared to the traditional message-passing model.
Document ID
19970011970
Acquisition Source
Legacy CDMS
Document Type
Conference Paper
Authors
Janssens, Bob (Illinois Univ. Urbana-Champaign, IL United States)
Fuchs, W. Kent (Illinois Univ. Urbana-Champaign, IL United States)
Date Acquired
August 17, 2013
Publication Date
October 1, 1994
Publication Information
Publication: Proceedings of the 13th Symposium on Reliable Distributed Systems