期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2013
卷号:2
期号:12
页码:3224-3226
出版社:Shri Pannalal Research Institute of Technolgy
摘要:Discovering patterns from sequence data has significant impact in many aspects of science and society. In the real world, usually a large set of patterns could be discovered yet many of them are redundant, thus degrading the output quality. To improve the output quality by removing two types of redundant patterns. First, the notion of delta tolerance closed itemset is employed to remove redundant patterns that are not delta closed. Second, the concept of statistically induced patterns is proposed to capture redundant patterns which seem to be statistically significant yet their significance is induced by their strong significant subpatterns. This approach produce a relatively small set of patterns which reveal interesting information in the sequences.