首页    期刊浏览 2025年02月18日 星期二
登录注册

文章基本信息

  • 标题:Transliteration as Alignment vs. Transliteration as Generation for Crosslingual Information Retrieval
  • 本地全文:下载
  • 作者:Anil Kumar Singh ; Sethuramalingam Subramaniam ; Taraka Rama
  • 期刊名称:Traitement Automatique des Langues
  • 印刷版ISSN:1248-9433
  • 电子版ISSN:1965-0906
  • 出版年度:2010
  • 卷号:51
  • 期号:2
  • 出版社:ATALA - Assoc Traitement Automatique Langues
  • 摘要:Crosslingual Information Retrieval (CLIR) usually requires query translation and, due to named entities in the case of IR, query translation requires a good transliteration system when writing systems differ. Transliteration can be seen as a problem of generation or alignment. For IR, since we can extract a word list from the corpus being searched, it should be seen as an alignment problem. The shift from generation to alignment can lead to higher transliteration accuracies and significant improvements in the CLIR results. We were able to achieve an increase (over generation) in the CLIR Mean Average Precision by 22.66% and 29.08% for English to Hindi and English to Marathi, respectively.
国家哲学社会科学文献中心版权所有