首页    期刊浏览 2024年12月05日 星期四
登录注册

文章基本信息

  • 标题:A Field Sensor: computing the composition and intent of PubMed queries
  • 作者:Yeganova, Lana ; Kim, Won ; Comeau, Donald C
  • 期刊名称:Database
  • 印刷版ISSN:1758-0463
  • 电子版ISSN:1758-0463
  • 出版年度:2018
  • 卷号:2018
  • 期号:1
  • DOI:10.1093/database/bay052
  • 出版社:Oxford University Press
  • 摘要:PubMed® is a search engine providing access to a collection of over 27 million biomedical bibliographic records as of 2017. PubMed processes millions of queries a day, and understanding these queries is one of the main building blocks for successful information retrieval. In this work, we present Field Sensor, a domain-specific tool for understanding the composition and predicting the user intent of PubMed queries. Given a query, the Field Sensor infers a field for each token or sequence of tokens in a query in multi-step process that includes syntactic chunking, rule-based tagging and probabilistic field prediction. In this work, the fields of interest are those associated with (meta-)data elements of each PubMed record such as article title, abstract, author name(s), journal title, volume, issue, page and date. We evaluate the accuracy of our algorithm on a human-annotated corpus of 10 000 PubMed queries, as well as a new machine-annotated set of 103 000 PubMed queries. The Field Sensor achieves an accuracy of 93 and 91% on the two corresponding corpora and finds that nearly half of all searches are navigational (e.g. author searches, article title searches etc.) and half are informational (e.g. topical searches). The Field Sensor has been integrated into PubMed since June 2017 to detect informational queries for which results sorted by relevance can be suggested as an alternative to those sorted by the default date sort. In addition, the composition of PubMed queries as computed by the Field Sensor proves to be essential for understanding how users query PubMed.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有