首页    期刊浏览 2025年02月21日 星期五
登录注册

文章基本信息

  • 标题:'Digital Narratives of COVID-19': A Twitter Dataset for Text Analysis in Spanish
  • 本地全文:下载
  • 作者:Susanna Allés-Torrent ; Gimena del Rio Riande ; Jerry Bonnell
  • 期刊名称:Journal of Open Humanities Data
  • 电子版ISSN:2059-481X
  • 出版年度:2021
  • 卷号:7
  • DOI:10.5334/johd.28
  • 语种:English
  • 出版社:Ubiquity Press
  • 摘要:'Digital Narratives of COVID-19' (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter’s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanish-speaking areas spanning North and Central America (Mexico, Colombia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.
国家哲学社会科学文献中心版权所有