首页    期刊浏览 2025年02月20日 星期四
登录注册

文章基本信息

  • 标题:Detecting Clusters of Outliers with Information Theoretic Clustering
  • 本地全文:下载
  • 作者:Shin Ando ; Einoshin Suzuki
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2008
  • 卷号:23
  • 期号:5
  • 页码:344-354
  • DOI:10.1527/tjsai.23.344
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:Identifying atypical objects is one of the classic tasks in machine learning. Recent works, e.g., One-class Clustering and Minority Detection, have explored the task further to identify clusters of atypical objects which strongly contrast from the rest of the dataset. In such problems, avoiding false positive detection is an important yet significantly difficult issue. In this paper, we propose an information theoretic clustering which aims to compactly represent the global and local structures of the dataset and identify atypical clusters in terms of information geometric distance. The former objective contributes to reducing the number of false positive detections. Its formalization further yields a unifying view of the classic outlier detection and the novel tasks. We present a scalable algorithm for detecting multiple clusters of atypical objects without a pre-defined number of clusters. The algorithm is evaluated as an unsupervised two-class classification using simulated datasets and a text classification benchmark.
  • 关键词:Information Bottleneck ; information theoretic clustering ; outlier detection ; minority detection
国家哲学社会科学文献中心版权所有