期刊名称:GI_FORUM - Journal for Geographic Information Science
电子版ISSN:2308-1708
出版年度:2021
卷号:9
期号:1
页码:68-75
DOI:10.1553/giscience2021_01_s68
语种:English
出版社:ÖAW Verlag, Wien
摘要:In this paper, we show a framework for partial bot rejection based on spatially supervised text mining from social media messages. We show qualitative results towards the reduction of known bots and give hints on how this cleaning technique can help us in filling gaps of current signals related to human life on Earth based on social media. The bot rejection framework is based on using a spatial signal for supervising a machine learning model with extreme label noise still being able to reject some of the unwanted components of the social media stream. Furthermore, we comment that such models show significant biases and can, therefore, not be used responsibly without bias analysis and mitigation per application.
关键词:social media analysis;text mining;data cleaning