首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:An Evaluation of a Knowledge Base of Words and Thesauruses on Measuring the Semantic Similarity between Words
  • 本地全文:下载
  • 作者:Takahiro Kawashima ; Tsutomu Ishikawa
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2005
  • 卷号:20
  • 期号:5
  • 页码:326-336
  • DOI:10.1527/tjsai.20.326
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:We have developed a knowledge base of words as a tool to measure the semantic similarity between words. In this paper, we evaluate the knowledge base of words comparing with thesauruses, which are commonly used for measuring similarity. Thesauruses of NIHONGO-GOI-TAIKEI(NGT) and Japan Electronic Dictionary(EDR) are selected for the evaluation. For similarity calculation methods using thesauruses, we adopt a newly proposed method, in which each word is represented with vector using the structural feature of thesauruses and the degree of similarity between words is calculated by the inner product of their vectors, in addition to traditional methods based on the path length between categories or the depth of the subsumer. Evaluation is carried out through the two methods, that is, a traditional method based on human rating and the method we have already proposed, feasible for evaluating automatically without human judgment. Evaluation result shows that the knowledge base of word is superior to the both thesauruses(NGT outperforming EDR) as measurement tools, and the proposed calculation method outperforms the traditional ones. The result also shows that our evaluation method is a practical one, by investigating the correlation of both methods.
  • 关键词:semantic similarity ; knowledge base of words ; thesaurus ; similarity calculation method
国家哲学社会科学文献中心版权所有