首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:Sequence-Based Discovery of Antibacterial Peptides Using Ensemble Gradient Boosting
  • 本地全文:下载
  • 作者:Ehdieh Khaledian ; Shira L. Broschat
  • 期刊名称:Proceedings
  • 电子版ISSN:2504-3900
  • 出版年度:2020
  • 卷号:54
  • 期号:61
  • 页码:6
  • DOI:10.3390/proceedings2020066006
  • 语种:English
  • 出版社:MDPI AG
  • 摘要:Antimicrobial resistance is driving pharmaceutical companies to investigate different therapeutic approaches. One approach that has garnered growing consideration in drug development is the use of antimicrobial peptides (AMPs). Antibacterial peptides (ABPs), which occur naturally as part of the immune response, can serve as powerful, broad-spectrum antibiotics. However, conventional laboratory procedures for screening and discovering ABPs are expensive and time-consuming. Identification of ABPs can be significantly improved using computational methods. In this paper, we introduce a machine learning method for the fast and accurate prediction of ABPs. We gathered more than 6000 peptides from publicly available datasets and extracted 1209 features (peptide characteristics) from these sequences. We selected the set of optimal features by applying correlation-based and random forest feature selection techniques. Finally, we designed an ensemble gradient boosting model (GBM) to predict putative ABPs. We evaluated our model using receiver operating characteristic (ROC) curves, calculating the area under the curve (AUC) for several different models for comparison, including a recurrent neural network, a support vector machine, and iAMPpred. The AUC for the GBM was ~0.98, more than 3% better than any of the other models.
国家哲学社会科学文献中心版权所有