首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Efficient Test and Visualization of Multi-Set Intersections
  • 本地全文:下载
  • 作者:Minghui Wang ; Yongzhong Zhao ; Bin Zhang
  • 期刊名称:Scientific Reports
  • 电子版ISSN:2045-2322
  • 出版年度:2015
  • 卷号:5
  • 期号:1
  • DOI:10.1038/srep16923
  • 语种:English
  • 出版社:Springer Nature
  • 摘要:Identification of sets of objects with shared features is a common operation in all disciplines. Analysis of intersections among multiple sets is fundamental for in-depth understanding of their complex relationships. However, so far no method has been developed to assess statistical significance of intersections among three or more sets. Moreover, the state-of-the-art approaches for visualization of multi-set intersections are not scalable. Here, we first developed a theoretical framework for computing the statistical distributions of multi-set intersections based upon combinatorial theory, and then accordingly designed a procedure to efficiently calculate the exact probabilities of multi-set intersections. We further developed multiple efficient and scalable techniques to visualize multi-set intersections and the corresponding intersection statistics. We implemented both the theoretical framework and the visualization techniques in a unified R software package, SuperExactTest . We demonstrated the utility of SuperExactTest through an intensive simulation study and a comprehensive analysis of seven independently curated cancer gene sets as well as six disease or trait associated gene sets identified by genome-wide association studies. We expect SuperExactTest developed by this study will have a broad range of applications in scientific data analysis in many disciplines.
国家哲学社会科学文献中心版权所有