首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:How to Match Bilingual Tweets?
  • 本地全文:下载
  • 作者:Karima Abidi ; Kamel Smaili
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2017
  • 卷号:7
  • 期号:3
  • 页码:95-106
  • DOI:10.5121/csit.2017.70309
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:In this paper, we propose a method that aligns comparable bilingual tweets which, not onlytakes into account the specificity of a Tweet, but treats also proper names, dates and numbers intwo different languages. This permits to retrieve more relevant target tweets. The process ofmatching proper names between Arabic and English is a difficult task, because these twolanguages use different scripts. For that, we used an approach which projects the sounds of anEnglish proper name into Arabic and aligns it with the most appropriate proper name. Weevaluated the method with a classical measure and compared it to the one we developed. Theexperiments have been achieved on two parallel corpora and shows that our measureoutperforms the baseline by 5.6% at R@1 recall.
  • 关键词:Comparability measure ; Arabic stemming; Proper names; Soundex; Twitter
国家哲学社会科学文献中心版权所有