首页    期刊浏览 2024年11月07日 星期四
登录注册

文章基本信息

  • 标题:PersianSemCor: A Bag of Word Sense Annotated Corpus for thePersian Language
  • 本地全文:下载
  • 作者:Hossein Rouhizadeh ; Mehrnoush Shamsfard ; Mahdi Dehghan
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:147-156
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Supervised approaches usually achieve the best performance in the Word Sense Disambiguation problem. However, the unavailability of large sense annotated corpora for many low-resource languages make these approaches inapplicable for them in practice. In this paper, we mitigate this issue for the Persian language by proposing a fully automatic approach for obtaining Persian SemCor (PerSemCor), as a Persian Bag-of-Word (BoW) sense-annotated corpus. We evaluated PerSemCor both intrinsically and extrinsically and showed that it can be effectively used as training sets for Persian supervised WSD systems. To encourage future research on Persian Word Sense Disambiguation, we release the PerSemCor in http://nlp.sbu.ac.ir.
国家哲学社会科学文献中心版权所有