首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:A Fast Clustering Algorithm for Data with a Few Labeled Instances
  • 本地全文:下载
  • 作者:Jinfeng Yang ; Yong Xiao ; Jiabing Wang
  • 期刊名称:Computational Intelligence and Neuroscience
  • 印刷版ISSN:1687-5265
  • 电子版ISSN:1687-5273
  • 出版年度:2015
  • 卷号:2015
  • DOI:10.1155/2015/196098
  • 出版社:Hindawi Publishing Corporation
  • 摘要:The diameter of a cluster is the maximum intracluster distance between pairs of instances within the same cluster, and the split of a cluster is the minimum distance between instances within the cluster and instances outside the cluster. Given a few labeled instances, this paper includes two aspects. First, we present a simple and fast clustering algorithm with the following property: if the ratio of the minimum split to the maximum diameter (RSD) of the optimal solution is greater than one, the algorithm returns optimal solutions for three clustering criteria. Second, we study the metric learning problem: learn a distance metric to make the RSD as large as possible. Compared with existing metric learning algorithms, one of our metric learning algorithms is computationally efficient: it is a linear programming model rather than a semidefinite programming model used by most of existing algorithms. We demonstrate empirically that the supervision and the learned metric can improve the clustering quality.
国家哲学社会科学文献中心版权所有