首页    期刊浏览 2025年06月07日 星期六
登录注册

文章基本信息

  • 标题:Topic Modeling for Maternal Health UsingReddit
  • 本地全文:下载
  • 作者:Shuang Gao ; Shivani Pandya ; Smisha Agarwal
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:69-76
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:This paper applies topic modeling to understand maternal health topics, concerns, and questions expressed in online communities on social networking sites. We examine Latent Dirichlet Analysis (LDA) and two state-of-the-art methods: neural topic model with knowledge distillation (KD) and Embedded Topic Model (ETM) on maternal health texts collected from Reddit. The models are evaluated on topic quality and topic inference, using both auto-evaluation metrics and human assessment. We analyze a disconnect between automatic metrics and human evaluations. While LDA performs the best overall with the auto-evaluation metrics NPMI and Coherence, Neural Topic Model with Knowledge Distillation is favorable by expert evaluation. We also create a new partially expert annotated gold-standard maternal health topic.
国家哲学社会科学文献中心版权所有