文章基本信息

标题：A New Data Mining Algorithm based on MapReduce and Hadoop
本地全文：下载
作者：Xianfeng Yang ; Liming Lian
期刊名称：International Journal of Signal Processing, Image Processing and Pattern Recognition
印刷版ISSN：2005-4254
出版年度：2014
卷号：7
期号：2
页码：131-142
DOI：10.14257/ijsip.2014.7.2.13
出版社：SERSC
摘要：The goal of data mining is to discover hidden useful information in large databases. Mining frequent patterns from transaction databases is an important problem in data mining. As the database size increases, the computation time and required memory also increase. Base on this, we use the MapReduce programming mode which has parallel processing ability to analysis the large-scale network. All the experiments were taken under hadoop, deployed on a cluster which consists of commodity servers. Through empirical evaluations in various simulation conditions, the proposed algorithms are shown to deliver excellent performance with respect to scalability and execution time
关键词：Parallel algorithm; MapReduce; Hadoop; data mining