首页    期刊浏览 2024年09月02日 星期一
登录注册

文章基本信息

  • 标题:Lexibank, a public repository of standardized wordlists with computed phonological and lexical features
  • 本地全文:下载
  • 作者:Johann-Mattis List ; Robert Forkel ; Simon J.Greenhill
  • 期刊名称:Scientific Data
  • 电子版ISSN:2052-4463
  • 出版年度:2022
  • 卷号:9
  • 期号:1
  • 页码:1-16
  • DOI:10.1038/s41597-022-01432-0
  • 语种:English
  • 出版社:Nature Publishing Group
  • 摘要:the past decades have seen substantial growth in digital data on the world’s languages. at the same time, the demand for cross-linguistic datasets has been increasing, as witnessed by numerous studies devoted to diverse questions on human prehistory, cultural evolution, and human cognition. Unfortunately, most published datasets lack standardization which makes their comparison difcult . Here, we present a new approach to increase the comparability of cross-linguistic lexical data. We have designed workfows for the computer-assisted lifting of datasets to Cross-Linguistic Data Formats, a collection of standards that make these datasets more Findable, accessible, Interoperable, and Reusable (FAIR) . We test the Lexibank workfow on 100 lexical datasets from which we derive an aggregated database of wordlists in unifed phonetic transcriptions covering more than 2000 language varieties . We illustrate the benefts of our approach by showing how phonological and lexical features can be automatically inferred, complementing and expanding existing cross-linguistic datasets.
国家哲学社会科学文献中心版权所有