首页    期刊浏览 2025年07月15日 星期二
登录注册

文章基本信息

  • 标题:A DATA MINING METHODOLOGY WITH PREPROCESSING STEPS
  • 本地全文:下载
  • 作者:Vita Špečkauskienė ; Arūnas Lukoševičius
  • 期刊名称:Public Policy And Administration
  • 印刷版ISSN:2029-2872
  • 出版年度:2015
  • 卷号:38
  • 期号:4
  • DOI:10.5755/j01.itc.38.4.12081
  • 语种:English
  • 出版社:Kaunas University of Technology
  • 摘要:This paper analyzes various problems that appear while performing data mining. The issues of data quality are discussed. The main focus is set on feature selection and its influence on classification results. Feature selection, or discovery of an optimal data set is a process of removing features from the data set that are not useful in decision making, and leaving the most useful ones. The influence of feature selection is analyzed for different classification algorithms. They are applied on two different (in constitution) data sets to solve three problems of medical domain. Presented results show that there is no universal algorithm, whitch could help solving any problem, as well as each data set has its own optimal (sub)set suitable for the classification algorithm. Methodological recommendations to reach possibly optimal solution are given to perform clinical decision support.
国家哲学社会科学文献中心版权所有