首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:A Copula-Based Supervised Learning Classification for Continuous and Discrete Data
  • 本地全文:下载
  • 作者:Yuhui Chen
  • 期刊名称:Journal of Data Science
  • 印刷版ISSN:1680-743X
  • 电子版ISSN:1683-8602
  • 出版年度:2016
  • 卷号:14
  • 期号:4
  • 页码:769-790
  • 出版社:Tingmao Publish Company
  • 摘要:Despite the unreasonable feature independence assumption, the naiveBayes classifier provides a simple way but competes well with more sophisticatedclassifiers under zero-one loss function for assigning an observation to a class giventhe features observed. However, it has been proved that the naive Bayes workspoorly in estimation and in classification for some cases when the features arecorrelated. To extend, researchers had developed many approaches to free of thisprimary but rarely satisfied assumption in the real world for the naive Bayes. In thispaper, we propose a new classifier which is also free of the independenceassumption by evaluating the dependence of features through pair copulasconstructed via a graphical model called D-Vine tree. This tree structure helps todecompose the multivariate dependence into many bivariate dependencies and thusmakes it possible to easily and efficiently evaluate the dependence of features evenfor data with high dimension and large sample size. We further extend the proposedmethod for features with discrete-valued entries. Experimental studies show thatthe proposed method performs well for both continuous and discrete cases.
  • 关键词:Classification; Copulas; D-Vine Tree; Multivariate Dependence; Naive;Bayes; Supervised Learning outlier.
国家哲学社会科学文献中心版权所有