文章基本信息

标题：Discovering Concepts from Word Co-occurrences with a Relational Model
本地全文：下载
作者：Kenichi Kurihara ; Yoshitaka Kameya ; Taisuke Sato 等
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2007
卷号：22
期号：2
页码：218-226
DOI：10.1527/tjsai.22.218
出版社：The Japanese Society for Artificial Intelligence
摘要：Clustering word co-occurrences has been studied to discover clusters as latent concepts. Previous work has applied the semantic aggregate model (SAM), and reports that discovered clusters seem semantically significant. The SAM assumes a co-occurrence arises from one latent concept. This assumption seems moderately natural. However, to analyze latent concepts more deeply, the assumption may be too restrictive. We propose to make clusters for each part of speech from co-occurrence data. For example, we make adjective clusters and noun clusters from adjective--noun co-occurrences while the SAM builds clusters of ``co-occurrences.'' The proposed approach allows us to analyze adjectives and nouns independently.
关键词：clustering ; Dirichlet process ; variational inference ; relational learning