文章基本信息

标题：Spartans@LT-EDI-EACL2021: Inclusive Speech Detection using Pretrained Language Models
本地全文：下载
作者：Megha Sharma ; Gaurav Arora
期刊名称：Conference on European Chapter of the Association for Computational Linguistics (EACL)
出版年度：2021
卷号：2021
页码：188-192
语种：English
出版社：ACL Anthology
摘要：We describe our system that ranked first in Hope Speech Detection (HSD) shared task and fourth in Offensive Language Identification (OLI) shared task, both in Tamil language. The goal of HSD and OLI is to identify if a code-mixed comment or post contains hope speech or offensive content respectively. We pre-train a transformer-based model RoBERTa using synthetically generated code-mixed data and use it in an ensemble along with their pre-trained ULMFiT model available from iNLTK.