期刊名称:International Journal of Grid and Distributed Computing
印刷版ISSN:2005-4262
出版年度:2016
卷号:9
期号:3
页码:135-144
DOI:10.14257/ijgdc.2016.9.3.16
出版社:SERSC
摘要:Grid use huge number of dynamic, distributed & heterogeneous resources under different organizations domain to executing user applications. Working with these resources, applications face problems due to occurrence of faults. There are number of different faults that occur due to many problems. Therefore grid computing needs an efficient fault tolerance mechanism. This fault tolerance makes the grid system capable to work correctly even in the presence of faults. In this paper we have study various kinds of faults and reasons for occurrence. With this, also study various kinds of existing fault tolerance techniques. We devise a strategy, based on proactive fault tolerance and take appropriate step, if any possibility for fault occurrence. Scheduling of task on available resources, which is provided by GIS consider previous history of these resources. This will decrease the probability of faults, execution time, and increase the execution rate. Proposed strategy also uses technique of check pointing to reduce execution cost. To implement proposed strategy GridSim toolkit is used for simulation