首页    期刊浏览 2024年11月27日 星期三
登录注册

文章基本信息

  • 标题:Generic medical concept embedding and time decay for diverse patient outcome prediction tasks
  • 本地全文:下载
  • 作者:Yupeng Li ; Wei Dong ; Boshu Ru
  • 期刊名称:iScience
  • 印刷版ISSN:2589-0042
  • 出版年度:2022
  • 卷号:25
  • 期号:9
  • 页码:1-15
  • DOI:10.1016/j.isci.2022.104880
  • 语种:English
  • 出版社:Elsevier
  • 摘要:SummaryMany fields, including Natural Language Processing (NLP), have recently witnessed the benefit of pre-training with large generic datasets to improve the accuracy of prediction tasks. However, there exist key differences between the longitudinal healthcare data (e.g., claims) and NLP tasks, which make the direct application of NLP pre-training methods to healthcare data inappropriate. In this article, we developed a pre-training scheme for longitudinal healthcare data that leverages the pairing of medical history and a future event. We then conducted systematic evaluations of various methods on ten patient-level prediction tasks encompassing adverse events, misdiagnosis, disease risks, and readmission. In addition to substantially reducing model size, our results show that a universal medical concept embedding pretrained with generic big data as well as carefully designed time decay modeling improves the accuracy of different downstream prediction tasks.Graphical abstractDisplay OmittedHighlights•This work develops a pre-training scheme for longitudinal healthcare data•The method leverages the pairing of medical history and a future event•We created a universal medical concept embedding pretrained with generic data•We designed a time-decay method for medical concept dataHealth sciences; Bioinformatics; Medical informatics; Automation in bioinformatics
国家哲学社会科学文献中心版权所有