首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Arabic Named Entity Recognition: Using Features Extracted from Noisy Data
  • 本地全文:下载
  • 作者:Yassine Benajiba ; Imed Zitouni ; Mona Diab
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2010
  • 卷号:2010
  • 出版社:ACL Anthology
  • 摘要:Building an accurate Named Entity Recognition (NER) system for languages with complex morphology is a challenging task. In this paper, we present research that explores the feature space using both gold and bootstrapped noisy features to build an improved highly accurate Arabic NER system. We bootstrap noisy features by projection from an Arabic-English parallel corpus that is automatically tagged with a baseline NER system. The feature space covers lexical, morphological, and syntactic features. The proposed approach yields an improvement of up to 1.64 F-measure (absolute).
国家哲学社会科学文献中心版权所有