期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2014
卷号:2
期号:9
出版社:S&S Publications
摘要:Data mining is process of extracting a large amount of data that need to be analysed patterns have to beextracted from that to share knowledge. In this new era with rumble of data together structured, semi-structured andunstructured, in the field of machine learning, educational data mining, web mining and text mining, environmentalresearch areas, it has become difficult to process, manage and analyzed patterns using traditional databases andrelational databases So, a appropriate architecture should be understood to gain and share knowledge about the BigData. This paper presents an analysis of various algorithms from for handling such big data set. These algorithmsdefine various structures and methods implemented to handle Big Data, also in the paper are listed various data miningtools that were developed for analyzing them.
关键词:Data Mining; Big Data; Clustering. Frequent Pattern; Association Rule;Hadoop and MapReduce;framework.