首页    期刊浏览 2025年02月17日 星期一
登录注册

文章基本信息

  • 标题:Boosting association rule mining in large datasets via Gibbs sampling
  • 本地全文:下载
  • 作者:Guoqi Qian ; Calyampudi Radhakrishna Rao ; Xiaoying Sun
  • 期刊名称:Proceedings of the National Academy of Sciences
  • 印刷版ISSN:0027-8424
  • 电子版ISSN:1091-6490
  • 出版年度:2016
  • 卷号:113
  • 期号:18
  • 页码:4958-4963
  • DOI:10.1073/pnas.1604553113
  • 语种:English
  • 出版社:The National Academy of Sciences of the United States of America
  • 摘要:Current algorithms for association rule mining from transaction data are mostly deterministic and enumerative. They can be computationally intractable even for mining a dataset containing just a few hundred transaction items, if no action is taken to constrain the search space. In this paper, we develop a Gibbs-sampling–induced stochastic search procedure to randomly sample association rules from the itemset space, and perform rule mining from the reduced transaction dataset generated by the sample. Also a general rule importance measure is proposed to direct the stochastic search so that, as a result of the randomly generated association rules constituting an ergodic Markov chain, the overall most important rules in the itemset space can be uncovered from the reduced dataset with probability 1 in the limit. In the simulation study and a real genomic data example, we show how to boost association rule mining by an integrated use of the stochastic search and the Apriori algorithm.
  • 关键词:association rule ; Gibbs sampling ; transaction data ; genomic data
国家哲学社会科学文献中心版权所有