首页    期刊浏览 2025年02月19日 星期三
登录注册

文章基本信息

  • 标题:Reconceptualizing the classification of PNAS articles
  • 本地全文:下载
  • 作者:Edoardo M. Airoldi ; Elena A. Erosheva ; Stephen E. Fienberg
  • 期刊名称:Proceedings of the National Academy of Sciences
  • 印刷版ISSN:0027-8424
  • 电子版ISSN:1091-6490
  • 出版年度:2010
  • 卷号:107
  • 期号:49
  • 页码:20899-20904
  • DOI:10.1073/pnas.1013452107
  • 语种:English
  • 出版社:The National Academy of Sciences of the United States of America
  • 摘要:PNAS article classification is rooted in long-standing disciplinary divisions that do not necessarily reflect the structure of modern scientific research. We reevaluate that structure using latent pattern models from statistical machine learning, also known as mixed-membership models, that identify semantic structure in co-occurrence of words in the abstracts and references. Our findings suggest that the latent dimensionality of patterns underlying PNAS research articles in the Biological Sciences is only slightly larger than the number of categories currently in use, but it differs substantially in the content of the categories. Further, the number of articles that are listed under multiple categories is only a small fraction of what it should be. These findings together with the sensitivity analyses suggest ways to reconceptualize the organization of papers published in PNAS.
  • 关键词:text analysis ; hierarchical modeling ; Monte Carlo Markov chain ; variational inference ; Dirichlet process
国家哲学社会科学文献中心版权所有