首页    期刊浏览 2025年05月31日 星期六
登录注册

文章基本信息

  • 标题:BERT-based Multi-Task Model for Country and Province LevelMSAand DialectalArabic Identification
  • 本地全文:下载
  • 作者:Abdellah El Mekki ; Abdelkader El Mahdaouy ; Kabil Essefar
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:271-275
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) model to tackle both country-level and province-level MSA/DA identification. The latter MTL model consists of a shared Bidirectional Encoder Representation Transformers (BERT) encoder, two task-specific attention layers, and two classifiers. Our key idea is to leverage both the task-discriminative and the inter-task shared features for country and province MSA/DA identification. The obtained results show that our MTL model outperforms single-task models on most subtasks.
国家哲学社会科学文献中心版权所有