首页    期刊浏览 2024年10月06日 星期日
登录注册

文章基本信息

  • 标题:M3C: Monte Carlo reference-based consensus clustering
  • 本地全文:下载
  • 作者:Christopher R. John ; David Watson ; Dominic Russ
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2020
  • 卷号:10
  • 期号:1
  • 页码:1-14
  • DOI:10.1038/s41598-020-58766-1
  • 出版社:Springer Nature
  • 摘要:Genome-wide data is used to stratify patients into classes for precision medicine using clustering algorithms. A common problem in this area is selection of the number of clusters (K). The Monti consensus clustering algorithm is a widely used method which uses stability selection to estimate K. However, the method has bias towards higher values of K and yields high numbers of false positives. As a solution, we developed Monte Carlo reference-based consensus clustering (M3C), which is based on this algorithm. M3C simulates null distributions of stability scores for a range of K values thus enabling a comparison with real data to remove bias and statistically test for the presence of structure. M3C corrects the inherent bias of consensus clustering as demonstrated on simulated and real expression data from The Cancer Genome Atlas (TCGA). For testing M3C, we developed clusterlab, a new method for simulating multivariate Gaussian clusters.
国家哲学社会科学文献中心版权所有