文章基本信息

标题：Redundancy Schemes for High Availability Computer Clusters
本地全文：下载
作者：Bassek, Christian K. ; Pierre, Samuel ; Quintero, Alejandro 等
期刊名称：Journal of Computer Science
印刷版ISSN：1549-3636
出版年度：2006
卷号：2
期号：1
页码：33-47
DOI：10.3844/jcssp.2006.33.47
出版社：Science Publications
摘要：The primary goal of computer clusters is to improve computing performances by taking advantage of the parallelism they intrinsically provide. Moreover, their use of redundant hardware components enables them to offer high availability services. In this paper, we present an analytical model for analyzing redundancy schemes and their impact on the cluster’s overall performance. Furthermore, several cluster redundancy techniques are analyzed with an emphasis on hardware and data redundancy, from which we derive an applicable redundancy scheme design. Also, our solution provides a disaster recovery mechanism that improves the cluster’s availability. In the case of data redundancy, we present improvements to the replication and parity data replication techniques for which we investigate the availability of the cluster under several scenarios that take into account, among other things, the number of replicated nodes, the number of CPUs that hold parity data and the relation between primary and replicated data. For this purpose, we developed a simulator that analyzes the impact of a redundancy scheme on the processing rate of the cluster. We also studied the performance of two well-known schemes according to the usage rate of the CPUs. We found that two important aspects influencing the performance of a transaction-oriented cluster were the cluster’s failover and data redundancy schemes. We simulated several data redundancy schemes and found that data replication offered higher cluster availability than the parity model.
关键词：Computer cluster; high availability; redundancy scheme; performance evaluation; fault tolerance