首页    期刊浏览 2025年09月21日 星期日
登录注册

文章基本信息

  • 标题:Essential gene prediction in Drosophila melanogaster using machine learning approaches based on sequence and functional features
  • 本地全文:下载
  • 作者:Olufemi Aromolaran ; Thomas Beder ; Marcus Oswald
  • 期刊名称:Computational and Structural Biotechnology Journal
  • 印刷版ISSN:2001-0370
  • 出版年度:2020
  • 卷号:18
  • 页码:612-621
  • DOI:10.1016/j.csbj.2020.02.022
  • 出版社:Computational and Structural Biotechnology Journal
  • 摘要:Genes are termed to be essential if their loss of function compromises viability or results in profound loss of fitness. On the genome scale, these genes can be determined experimentally employing RNAi or knockout screens, but this is very resource intensive. Computational methods for essential gene prediction can overcome this drawback, particularly when intrinsic (e.g. from the protein sequence) as well as extrinsic features (e.g. from transcription profiles) are considered. In this work, we employed machine learning to predict essential genes in Drosophila melanogaster . A total of 27,340 features were generated based on a large variety of different aspects comprising nucleotide and protein sequences, gene networks, protein-protein interactions, evolutionary conservation and functional annotations. Employing cross-validation, we obtained an excellent prediction performance. The best model achieved in D . melanogaster a ROC-AUC of 0.90, a PR-AUC of 0.30 and a F1 score of 0.34. Our approach considerably outperformed a benchmark method in which only features derived from the protein sequences were used (P < 0.001). Investigating which features contributed to this success, we found all categories of features, most prominently network topological, functional and sequence-based features. To evaluate our approach we performed the same workflow for essential gene prediction in human and achieved an ROC-AUC = 0.97, PR-AUC = 0.73, and F1 = 0.64..
  • 关键词:Machine;learning ; Essential genes ; Lethal ; Drosophila ; Essentiality prediction ; Homo sapiens
国家哲学社会科学文献中心版权所有