首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Cross-Lingual Sentiment Classification from English to Arabic using Machine Translation
  • 本地全文:下载
  • 作者:Adel Al-Shabi ; Aisah Adel ; Nazlia Omar
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2017
  • 卷号:8
  • 期号:12
  • DOI:10.14569/IJACSA.2017.081257
  • 出版社:Science and Information Society (SAI)
  • 摘要:Cross-lingual sentiment learning is becoming increasingly important due to the multilingual nature of user-generated content on social media and the scarce resources for languages other than English. However, cross-lingual sentiment learning is a challenging task due to the different distribution between translated data and original data and due to the language gap, i.e. each language has its own ways to express sentiments. This work explores the adaptation of English resources for sentiment analysis to a new language, Arabic. The aim is to design a light model for cross-lingual sentiment classification from English to Arabic, without any manual annotation effort which, at the same time, is easy to build and does not require deep linguistic analysis. The ultimate goal is to find an optimal baseline model and to determine the relation between the noise in the translated data and the accuracy of sentiment classification. Different configurations of several factors are investigated including feature representation, feature reduction methods, and the learning algorithms to find the optimal baseline model. Experiments show that a good classification model can be obtained from translated data regardless of the artificial noise added by machine translation. The results also show a significant cost to automation, and thus the best path to future enhancement is through the inclusion of language-specific knowledge and resources.
  • 关键词:Cross-lingual sentiment classification; English to Arabic; machine translation
国家哲学社会科学文献中心版权所有