首页    期刊浏览 2024年11月25日 星期一
登录注册

文章基本信息

  • 标题:Using part-of-speech tags as deep-syntax indicators in determining short-text semantic similarity
  • 本地全文:下载
  • 作者:Batanović Vuk ; Bojić Dragan
  • 期刊名称:Computer Science and Information Systems
  • 印刷版ISSN:1820-0214
  • 电子版ISSN:2406-1018
  • 出版年度:2015
  • 卷号:12
  • 期号:1
  • 页码:1-31
  • DOI:10.2298/CSIS131127082B
  • 出版社:ComSIS Consortium
  • 摘要:

    This paper presents POST STSS, a method of determining short-text semantic similarity in which part-of-speech tags are used as indicators of the deeper syntactic information usually extracted by more advanced tools like parsers and semantic role labelers. Our model employs a part-of-speech weighting scheme and is based on a statistical bag-of-words approach. It does not require either hand-crafted knowledge bases or advanced syntactic tools, which makes it easily applicable to languages with limited natural language processing resources. By using a paraphrase recognition test, we demonstrate that our system achieves a higher accuracy than all existing statistical similarity algorithms and solutions of a more structural kind. [Projekat Ministarstva nauke Republike Srbije, br. TR 32047]

  • 关键词:short-text semantic similarity; statistical similarity; corpus-based measures; part-of-speech tags; POS weighting; syntactic information; bag-of words model; natural language processing
国家哲学社会科学文献中心版权所有