首页    期刊浏览 2024年09月02日 星期一
登录注册

文章基本信息

  • 标题:Clustering of measures via mean measure quantization
  • 本地全文:下载
  • 作者:Frédéric Chazal ; Clément Levrard ; Martin Royer
  • 期刊名称:Electronic Journal of Statistics
  • 印刷版ISSN:1935-7524
  • 出版年度:2021
  • 卷号:15
  • 期号:1
  • 页码:2060-2104
  • DOI:10.1214/21-EJS1834
  • 语种:English
  • 出版社:Institute of Mathematical Statistics
  • 摘要:This paper addresses the case where data come as point sets, or more generally as measures. Our goal is to build from data an embedding of these measures into a finite-dimensional Euclidean space, that allows for provably efficient clustering of the source measures.The vectorization technique we propose relies on finding a compactly supported approximation of the mean measure generating process, that coincides with the intensity measure in the point process framework. To this aim we provide two algorithms that we prove almost minimax optimal.We assess the practical validity of our approach, first by showing that our results apply in the framework of persistence-based shape classification via the ATOL procedure described in [34]. At last, numerical experiments are carried out on simulated and real datasets, encompassing text classification and large-scale graph classification.
  • 关键词:62H30; clustering; quantization; topological data analysis
国家哲学社会科学文献中心版权所有