首页    期刊浏览 2024年09月29日 星期日
登录注册

文章基本信息

  • 标题:Evaluating the Representativeness of Socio-Demographic Variables over Time for Geo-Social Media Data
  • 本地全文:下载
  • 作者:Andreas Petutschnig ; Bernd Resch ; Stefan Lang
  • 期刊名称:ISPRS International Journal of Geo-Information
  • 电子版ISSN:2220-9964
  • 出版年度:2021
  • 卷号:10
  • 期号:5
  • 页码:323
  • DOI:10.3390/ijgi10050323
  • 语种:English
  • 出版社:MDPI AG
  • 摘要:Geo-social media data are widely used as a data source to model populations and processes in a variety of contexts. However, if the data do not adequately represent the population they are drawn from, analysis results will be biased. Unaddressed, these biases may lead to false interpretations and conclusions. In this paper, we propose a generic methodology for investigating the representativeness of geo-social media data for population groups of similar statistical predictive power based on reference data. The groups are designed to be spatially coherent regions with similar prediction errors. Based on these units, we investigate the influence of different socio-demographic covariates on the representativeness. We perform experiments based on over 1.6 billion tweets and 90 socio-demographic covariates. We demonstrate that Twitter data representativeness varies strongly over time and space. Our results show that densely populated areas tend to be underrepresented consistently in non-spatial models. Over time, some covariates like the number of people aged 20 years exhibit highly different effects on the prediction models, whereas others are much more stable. The spatial effects can most frequently be explained using spatial error models, indicating spatially related errors that indicate the necessity of additional covariates. Finally, we provide hints for interpreting the results of our approach for researchers using the concepts presented in this paper.
国家哲学社会科学文献中心版权所有