首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:A New Co-Ordinated Checkpointing and Rollback Recovery Scheme for Distributed Shared Memory Clusters
  • 本地全文:下载
  • 作者:Minakshi Tripathy ; C.R. Tripathy
  • 期刊名称:International Journal of Distributed and Parallel Systems
  • 印刷版ISSN:2229-3957
  • 电子版ISSN:0976-9757
  • 出版年度:2011
  • 卷号:2
  • 期号:1
  • DOI:10.5121/ijdps.2011.2104
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:In this paper, an unified lightweight error recovery scheme based on coordinated checkpointing and rollback for distributed shared memory clusters is proposed. The new scheme maintains multiple globally consistent checkpoints of the state of a distributed shared memory cluster and recovers to a pre-fault checkpoint of the system. It also describes and evaluates the coordinated checkpointing. The coordinated checkpoint neither needs to exchange coordination messages nor adds information to the process messages. It only accesses stable storage when checkpoints are saved. Each of the processes saves its state independently from the other processes. The checkpoint timers are set at different processes. Based on the results of performance evaluation the proposed scheme is shown to outperform the previously proposed checkpoint and recovery schemes for distributed shared memory clusters
  • 关键词:Fault tolerance; global states; checkpoint timers; clock drift rate; consistency; and recoverability.;1.
国家哲学社会科学文献中心版权所有