首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Ensemble Modeling Approach Targeting Heterogeneous RNA-Seq data: Application to Melanoma Pseudogenes
  • 本地全文:下载
  • 作者:Enrico Capobianco ; Camilo Valdes ; Samanta Sarti
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2017
  • 卷号:7
  • 期号:1
  • 页码:17344
  • DOI:10.1038/s41598-017-17337-7
  • 语种:English
  • 出版社:Springer Nature
  • 摘要:We studied the transcriptome landscape of skin cutaneous melanoma (SKCM) using 103 primary tumor samples from TCGA, and measured the expression levels of both protein coding genes and non-coding RNAs (ncRNAs). In particular, we emphasized pseudogenes potentially relevant to this cancer. While cataloguing the profiles based on the known biotypes, all the employed RNA-Seq methods generated just a small consensus of significant biotypes. We thus designed an approach to reconcile the profiles from all methods following a simple strategy: we selected genes that were confirmed as differentially expressed by the ensemble predictions obtained in a regression model. The main advantages of this approach are: 1) Selection of a high-confidence gene set identifying relevant pathways; 2) Use of a regression model whose covariates embed all method-driven outcomes to predict an averaged profile; 3) Method-specific assessment of prediction power and significance. Furthermore, the approach can be generalized to any biological system for which noisy RNA-Seq profiles are computed. As our analyses concerned bio-annotations of both high-quality protein coding genes and ncRNAs, we considered the associations between pseudogenes and parental genes (targets). Among the candidate targets that were validated, we identified PINK1, which is studied in patients with Parkinson and cancer (especially melanoma).
国家哲学社会科学文献中心版权所有