首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:UniBic: Sequential row-based biclustering algorithm for analysis of gene expression data
  • 本地全文:下载
  • 作者:Zhenjia Wang ; Guojun Li ; Robert W. Robinson
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2016
  • 卷号:6
  • 期号:1
  • DOI:10.1038/srep23466
  • 语种:English
  • 出版社:Springer Nature
  • 摘要:Biclustering algorithms, which aim to provide an effective and efficient way to analyze gene expression data by finding a group of genes with trend-preserving expression patterns under certain conditions, have been widely developed since Morgan et al. pioneered a work about partitioning a data matrix into submatrices with approximately constant values. However, the identification of general trend-preserving biclusters which are the most meaningful substructures hidden in gene expression data remains a highly challenging problem. We found an elementary method by which biologically meaningful trend-preserving biclusters can be readily identified from noisy and complex large data. The basic idea is to apply the longest common subsequence (LCS) framework to selected pairs of rows in an index matrix derived from an input data matrix to locate a seed for each bicluster to be identified. We tested it on synthetic and real datasets and compared its performance with currently competitive biclustering tools. We found that the new algorithm, named UniBic, outperformed all previous biclustering algorithms in terms of commonly used evaluation scenarios except for BicSPAM on narrow biclusters. The latter was somewhat better at finding narrow biclusters, the task for which it was specifically designed.
国家哲学社会科学文献中心版权所有