首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Biomarker discovery for predicting spontaneous preterm birth from gene expression data by regularized logistic regression
  • 本地全文:下载
  • 作者:Lingyu Li ; Zhi-Ping Liu
  • 期刊名称:Computational and Structural Biotechnology Journal
  • 印刷版ISSN:2001-0370
  • 出版年度:2020
  • 卷号:18
  • 页码:3434-3446
  • DOI:10.1016/j.csbj.2020.10.028
  • 出版社:Computational and Structural Biotechnology Journal
  • 摘要:In this work, we provide a computational method of regularized logistic regression for discovering biomarkers of spontaneous preterm birth (SPTB) from gene expression data. The successful identification of SPTB biomarkers will greatly benefit the interference of infant gestational age for reducing the risks of pregnant women and preemies. In recent years, various approaches have been proposed for the feature selection of identifying the subset of meaningful genes that can achieve accurate classification for disease samples from controls. Here, we comprehensively summarize the regularized logistic regression with seven effective penalties developed for the selection of strongly indicative genes of SPTB from microarray data. We compare their properties and assess their classification performances in multiple datasets. It shows that elastic net, lasso, L 1 / 2 and SCAD penalties get the better performance than others and can be successfully used to identify biomarkers of SPTB. Particularly, we make a functional enrichment analysis on these biomarkers and construct a logistic regression classifier based on them. The classifier generates an indicator of preterm risk score (PRS) for predicting SPTB. Based on the trained predictor, we verify the identified biomarkers on an independent dataset. The biomarkers achieve the AUC value of 0.933 in the SPTB classification. The results demonstrate the effectiveness and efficiency of the built-up strategy of biomarker discovery with regularized logistic regression. Obviously, the proposed method of discovering biomarkers for SPTB can be easily extended for other complex diseases.
  • 关键词:Biomarker discovery ; Spontaneous preterm birth ; Gene expression data ; Regularized logistic regression ; Feature selection ; Preterm risk score
国家哲学社会科学文献中心版权所有