首页    期刊浏览 2025年03月11日 星期二
登录注册

文章基本信息

  • 标题:Partially Bayesian variable selection in classification trees
  • 本地全文:下载
  • 作者:Xuming He ; Douglas A. Noe
  • 期刊名称:Statistics and Its Interface
  • 印刷版ISSN:1938-7989
  • 电子版ISSN:1938-7997
  • 出版年度:2008
  • 卷号:1
  • 期号:1
  • 页码:155-167
  • DOI:10.4310/SII.2008.v1.n1.a13
  • 出版社:International Press
  • 摘要:Tree-structured models for classification may be split into two broad categories: those that are completely datadriven and those that allow some direct user interaction during model construction. Classifiers such as CART [3] and QUEST [11] are members of the first category. In those datadriven algorithms, all predictor variables compete equally for a particular classification task. However, in many cases a subject-area expert is likely to have some qualitative notion about their relative importance. Interactive algorithms such as RTREE [17] address this issue by allowing users to select variables at various stages of tree construction. In this paper, we introduce a more formal partially Bayesian procedure for dynamically incorporating qualitative expert opinions in the construction of classification trees. An algorithm that dynamically incorporates expert opinion in this way has two potential advantages, each improving with the quality of the expert. First, by de-emphasizing certain subsets of variables during the estimation process, machine-based computational activity can be reduced. Second, by giving an expert’s preferred variables priority, we reduce the chance that a spurious variable will appear in the model. Hence, our resulting models are potentially more interpretable and less unstable than those generated by purely data-driven algorithms.
  • 关键词:feature selection; expert opinion; supervised learning
国家哲学社会科学文献中心版权所有