首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Hybrid Wasserstein distance and fast distribution clustering
  • 本地全文:下载
  • 作者:Isabella Verdinelli ; Larry Wasserman
  • 期刊名称:Electronic Journal of Statistics
  • 印刷版ISSN:1935-7524
  • 出版年度:2019
  • 卷号:13
  • 期号:2
  • 页码:5088-5119
  • DOI:10.1214/19-EJS1639
  • 语种:English
  • 出版社:Institute of Mathematical Statistics
  • 摘要:We define a modified Wasserstein distance for distribution clustering which inherits many of the properties of the Wasserstein distance but which can be estimated easily and computed quickly. The modified distance is the sum of two terms. The first term — which has a closed form — measures the location-scale differences between the distributions. The second term is an approximation that measures the remaining distance after accounting for location-scale differences. We consider several forms of approximation with our main emphasis being a tangent space approximation that can be estimated using nonparametric regression and leads to fast and easy computation of barycenters which otherwise would be very difficult to compute. We evaluate the strengths and weaknesses of this approach on simulated and real examples.
  • 关键词:Clustering; Wasserstein
国家哲学社会科学文献中心版权所有