首页    期刊浏览 2025年07月01日 星期二
登录注册

文章基本信息

  • 标题:Using Twitter to Detect Hate Crimes and Their Motivations: The HateMotiv Corpus
  • 本地全文:下载
  • 作者:Noha Alnazzawi
  • 期刊名称:Data
  • 印刷版ISSN:2306-5729
  • 出版年度:2022
  • 卷号:7
  • 期号:6
  • 页码:1-10
  • DOI:10.3390/data7060069
  • 语种:English
  • 出版社:MDPI Publishing
  • 摘要:With the rapidly increasing use of social media platforms, much of our lives is spent online. Despite the great advantages of using social media, unfortunately, the spread of hate, cyberbullying, harassment, and trolling can be very common online. Many extremists use social media platforms to communicate their messages of hatred and spread violence, which may result in serious psychological consequences and even contribute to real-world violence. Thus, the aim of this research was to build the HateMotiv corpus, a freely available dataset that is annotated for types of hate crimes and the motivation behind committing them. The dataset was developed using Twitter as an example of social media platforms and could provide the research community with a very unique, novel, and reliable dataset. The dataset is unique as a consequence of its topic-specific nature and its detailed annotation. The corpus was annotated by two annotators who are experts in annotation based on unified guidelines, so they were able to produce an annotation of a high standard with F-scores for the agreement rate as high as 0.66 and 0.71 for type and motivation labels of hate crimes, respectively.
  • 关键词:text mining;corpus construction;annotation guidelines;hate crime motivation
国家哲学社会科学文献中心版权所有