首页    期刊浏览 2024年09月20日 星期五
登录注册

文章基本信息

  • 标题:Placement of Nouns in a Multi-Dimensional Space Based on Words' Cooccurrency
  • 本地全文:下载
  • 作者:Yoichi Tomiura ; Shosaku Tanaka ; Toru Hitaka
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2004
  • 卷号:19
  • 期号:1
  • 页码:1-9
  • DOI:10.1527/tjsai.19.1
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:The semantic similarity (or distance) between words is one of the basic knowledge in Natural Language Processing. There have been several previous studies on measuring the similarity (or distance) based on word vectors in a multi-dimensional space. In those studies, high dimensional feature vectors of words are made from words' cooccurrence in a corpus or from reference relation in a dictionary, and then the word vectors are calculated from the feature vectors through the method like principal component analysis. This paper proposes a new placement method of nouns into a multi-dimensional space based on words' cooccurrence in a corpus. The proposed method doesn't use the high dimensional feature vectors of words, but is based on the idea that ``vectors corresponding to nouns which cooccur with a word w in a relation f constitute a group in the multi-dimensional space''. Although the whole meaning of nouns isn't reflected in the word vectors obtained by the pro posed method, the semantic similarity (or distance) between nouns defined with the word vectors is proper for an example-based disambiguation method.
  • 关键词:word vector ; multivariate analysis ; semantic similarity
国家哲学社会科学文献中心版权所有