首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:A Feature Subset Selection Technique for High Dimensional Data Using Symmetric Uncertainty
  • 本地全文:下载
  • 作者:Bharat Singh , Nidhi Kushwaha , Om Prakash Vyas
  • 期刊名称:Journal of Data Analysis and Information Processing
  • 印刷版ISSN:2327-7211
  • 电子版ISSN:2327-7203
  • 出版年度:2014
  • 卷号:02
  • 期号:04
  • 页码:95-105
  • DOI:10.4236/jdaip.2014.24012
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:With the abundance of exceptionally High Dimensional data, feature selection has become an essential element in the Data Mining process. In this paper, we investigate the problem of efficient feature selection for classification on High Dimensional datasets. We present a novel filter based approach for feature selection that sorts out the features based on a score and then we measure the performance of four different Data Mining classification algorithms on the resulting data. In the proposed approach, we partition the sorted feature and search the important feature in forward manner as well as in reversed manner, while starting from first and last feature simultaneously in the sorted list. The proposed approach is highly scalable and effective as it parallelizes over both attribute and tuples simultaneously allowing us to evaluate many of potential features for High Dimensional datasets. The newly proposed framework for feature selection is experimentally shown to be very valuable with real and synthetic High Dimensional datasets which improve the precision of selected features. We have also tested it to measure classification accuracy against various feature selection process.
  • 关键词:High Dimensional Datasets; Feature Selection; Classification; Predominant Feature
国家哲学社会科学文献中心版权所有