期刊名称:International Journal of Information and Communication Technology Research
电子版ISSN:2223-4985
出版年度:2012
卷号:2
期号:4
出版社:IRPN Publishers
摘要:In this paper, we have study the performance of k-Mean data-mining algorithm (k-Mean),which is implemented on the heterogeneous compute cluster with the multi core programming. The multicore program is implemented with MPI and C for the parallel computing and utilizing the maximum compute power of the heterogeneous cluster. The heterogeneous cluster is established with the help of MPICH2. We have also analyzed the efficiency and performance of k-Mean data mining algorithm for the large dataset. The dataset, which we have used, is chess.txt [1]. The dataset is divided into the number of cores and core compute the dataset independently and makes a data cluster of similar dataset on each processor core. Through this implementation, we have justified that the communication time among the processor cannot be negligible for large dataset. So, the compute time for same dataset on different processors core with same speed and memory is different and the different processor with different speeds and memory access also take different time.