首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Comparison of Collocation Extraction Measures for Document Indexing
  • 本地全文:下载
  • 作者:Dalbelo Basic, Bojana ; Kolar, Mladen ; Snajder, Jan
  • 期刊名称:Journal of Computing and Information Technology
  • 印刷版ISSN:1330-1136
  • 电子版ISSN:1846-3908
  • 出版年度:2006
  • 卷号:14
  • 期号:4
  • 页码:321-327
  • 出版社:SRCE - Sveučilišni računski centar
  • 摘要:Automatic extraction of collocations from a corpus is a well-known problem in the field of natural language processing. It is typically carried out by employing some kind of a statistical measure that indicates whether or not two words occur together more often than by chance. As there is an aboundance of these measures proposed by various authors, we have compared some of them on a task of extracting collocations from a corpus of Croatian legal documents for the purpose of document indexing. We propose and evaluate extensions of these measures for collocations consisting of three words.
国家哲学社会科学文献中心版权所有