首页    期刊浏览 2025年07月11日 星期五
登录注册

文章基本信息

  • 标题:A Parallel Incremental Mining Algorithm using Modified Inverted Matrix
  • 本地全文:下载
  • 作者:Siddharth Shah ; Aniruddha K ; Prerak Thakkar
  • 期刊名称:International Journal of Computer Science and Information Technologies
  • 电子版ISSN:0975-9646
  • 出版年度:2017
  • 卷号:8
  • 期号:2
  • 页码:232-236
  • 出版社:TechScience Publications
  • 摘要:The association rule mining has been very useful in manybusiness applications, such as market analysis, web data analysis,decision making, knowing customer purchase behavior etc. In almostall transactional databases, new transactions are added with time. Soan efficient algorithm is required to be developed in order to avoid rescanningof old datasets. Incremental mining [18] deals withgenerating association rules based on available knowledge (obtainedfrom mining of previously stored databases) and incrementeddatabases, without scanning the previously mined databases again. Inthis paper a novel approach of using horizontal and vertical databaselayout, a new representation of transactional database, has beenproposed (modified version of Inverted Matrix [17]) to mine largedatabase incrementally. One of the major advantage of this approachis that it generates the data structure in a single scan of the databaseand whenever new database is added incrementally, the generatedstructure is updated to consider the effect of incremented database. Inthis paper the Modified Inverted Matrix is distributed amongstparallel nodes. Frequent item from the modified inverted matrix isassigned to parallel nodes in alternate splitting fashion. In parallelimplementation, a Co-Occurrence Frequent Item (COFI) tree [17] forassigning frequent item is generated by the parallel nodes. Miningprocess is accomplished by all nodes which generate all frequent itemsin which the assigned items are participated. Here, lesscommunication is required amongst the master node and all parallelnode to generate all frequent item sets. Moreover, one of theadditional advantage is that the algorithm still responds correctlywithout the need for changing the data structure, whenever desiredsupport value changes. In this paper we provide the theoretical proofof concept for our proposed approach.
  • 关键词:Parallel mining; Incremental mining; Inverted matrix; COFI;tree.
国家哲学社会科学文献中心版权所有