首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:CLASSIFICATION ALGORITHMS FOR BIG DATA ANALYSIS, A MAP REDUCE APPROACH
  • 本地全文:下载
  • 作者:V. A. Ayma ; R. S. Ferreira ; P. Happ
  • 期刊名称:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
  • 印刷版ISSN:2194-9042
  • 电子版ISSN:2194-9050
  • 出版年度:2015
  • 卷号:XL-3/W2
  • 页码:17-21
  • DOI:10.5194/isprsarchives-XL-3-W2-17-2015
  • 出版社:Copernicus Publications
  • 摘要:Since many years ago, the scientific community is concerned about how to increase the accuracy of different classification methods, and major achievements have been made so far. Besides this issue, the increasing amount of data that is being generated every day by remote sensors raises more challenges to be overcome. In this work, a tool within the scope of InterIMAGE Cloud Platform (ICP) , which is an open-source, distributed framework for automatic image interpretation, is presented. The tool, named ICP: Data Mining Package , is able to perform supervised classification procedures on huge amounts of data, usually referred as big data , on a distributed infrastructure using Hadoop MapReduce. The tool has four classification algorithms implemented, taken from WEKA’s machine learning library, namely: Decision Trees, Naïve Bayes, Random Forest and Support Vector Machines (SVM). The results of an experimental analysis using a SVM classifier on data sets of different sizes for different cluster configurations demonstrates the potential of the tool, as well as aspects that affect its performance
  • 关键词:Big Data; MapReduce Framework; Hadoop; Classification Algorithms; Cloud Computing
国家哲学社会科学文献中心版权所有