首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:The Game Walkthrough Corpus (GWTC) – A Resource for the Analysis of Textual Game Descriptions
  • 本地全文:下载
  • 作者:Manuel Burghardt ; Jochen Tiepmar
  • 期刊名称:Journal of Open Humanities Data
  • 电子版ISSN:2059-481X
  • 出版年度:2021
  • 卷号:7
  • DOI:10.5334/johd.34
  • 语种:English
  • 出版社:Ubiquity Press
  • 摘要:We present the Game Walkthrough Corpus (GWTC), which contains 12,295 unique walkthrough documents covering 6,117 games. For each game walkthrough, we provide frequencies of unigrams and bigrams, treating the walkthrough document as a Bag of Words. In addition, we provide word frequencies at the sentence level. Furthermore, the GWTC contains a number of game-related metadata, including title, publisher, developer, year, and genre. All the language statistics and metadata are stored in separate plain text files and can be referenced through uniform resource names (URN). These URNs can also be used to derive any combination of statistics and metadata. Researchers, for instance, can investigate the most frequent unigrams for games in the “Adventure” genre. This way, the GWTC can be reused for different kinds of research questions on gaming language.
国家哲学社会科学文献中心版权所有