期刊名称:International Journal on Computer Science and Engineering
印刷版ISSN:2229-5631
电子版ISSN:0975-3397
出版年度:2010
卷号:2
期号:6
页码:1999-2002
出版社:Engg Journals Publications
摘要:In data mining and knowledge discovery, for finding the significant correlation among events Pattern discovery (PD) is used. PD typically produces an overwhelming number of patterns. Since there are too many patterns, it is difficult to use them to further explore or analyze the data. To address the problems in Pattern Discovery, a new method that simultaneously clusters the discovered patterns and their associated data. It is referred to as �Simultaneous pattern and data clustering using Modified K-means Algorithm�. One important property of the proposed method is that each pattern cluster is explicitly associated with a corresponding data cluster. Modified Kmeans algorithm is used to cluster patterns and their associated data. After clusters are found, each of them can be further explored and analyzed individually. The proposed method reduces the number of iterations to cluster the given data. The experimental results using the proposed algorithm with a group of randomly constructed data sets are very promising.
关键词:Pattern Discovery; Contingency table; and Chi-Square test