首页    期刊浏览 2025年06月17日 星期二
登录注册

文章基本信息

  • 标题:How many clusters?,
  • 本地全文:下载
  • 作者:Peter McCullagh ; Jie Yang
  • 期刊名称:Bayesian Analysis
  • 印刷版ISSN:1931-6690
  • 电子版ISSN:1936-0975
  • 出版年度:2008
  • 卷号:03
  • 期号:01
  • 页码:101-120
  • 出版社:International Society for Bayesian Analysis
  • 摘要:The title poses a deceptively simple question that must be addressed by any statistical model or computational algorithm for the clustering of points. Two distinct interpretations are possible, one connected with the number of clusters in the sample and one with the number in the population. Under suitable conditions, these questions may have essentially the same answer, but it is logically possible for one answer to be nite and the other in nite. This paper reformulates the standard Dirichlet allocation model as a cluster process in such a way that these and related questions can be addressed directly. Our conclusion is that the data are sometimes informative for clustering points in the sample, but they seldom contain much information about parameters such as the number of clusters in the population.
  • 关键词:Cluster process; Dirichlet partition; Gauss-Ewens process; Random sub-clusters; Species-counting model
国家哲学社会科学文献中心版权所有