首页    期刊浏览 2024年09月16日 星期一
登录注册

文章基本信息

  • 标题:Spatially Supervised Text Mining for Social Media Cleaning and Preprocessing (Abstract)
  • 本地全文:下载
  • 作者:Martin Werner
  • 期刊名称:GI_FORUM - Journal for Geographic Information Science
  • 电子版ISSN:2308-1708
  • 出版年度:2021
  • 卷号:9
  • 期号:1
  • 页码:68-75
  • DOI:10.1553/giscience2021_01_s68
  • 语种:English
  • 出版社:ÖAW Verlag, Wien
  • 摘要:In this paper, we show a framework for partial bot rejection based on spatially supervised text mining from social media messages. We show qualitative results towards the reduction of known bots and give hints on how this cleaning technique can help us in filling gaps of current signals related to human life on Earth based on social media. The bot rejection framework is based on using a spatial signal for supervising a machine learning model with extreme label noise still being able to reject some of the unwanted components of the social media stream. Furthermore, we comment that such models show significant biases and can, therefore, not be used responsibly without bias analysis and mitigation per application.
  • 关键词:social media analysis;text mining;data cleaning
国家哲学社会科学文献中心版权所有