首页    期刊浏览 2025年02月20日 星期四
登录注册

文章基本信息

  • 标题:Improved Named Entity Recognition using Machine Translation-based Cross-lingual Information
  • 本地全文:下载
  • 作者:Sandipan Dandapat ; Andy Way
  • 期刊名称:Computación y Sistemas
  • 印刷版ISSN:1405-5546
  • 出版年度:2016
  • 卷号:20
  • 期号:3
  • 页码:495-504
  • 语种:English
  • 出版社:Instituto Politécnico Nacional
  • 其他摘要:In this paper, we describe a technique to improve named entity recognition in a resource-poor language (Hindi) by using cross-lingual information. We use an on-line machine translation system and a separate word alignment phase to find the projection of each Hindi word into the translated English sentence. We estimate the cross-lingual features using an English named entity recognizer and the alignment information. We use these cross-lingual features in a support vector machine-based classifier. The use of cross-lingual features improves F1 score by 2.1 points absolute (2.9% relative) over a good-performing baseline model.
  • 其他关键词:Named entity recognition; machine translation; cross-lingual information.
国家哲学社会科学文献中心版权所有