首页    期刊浏览 2024年11月28日 星期四
登录注册

文章基本信息

  • 标题:Minimum Segmentation for Pan-genomic Founder Reconstruction in Linear Time
  • 本地全文:下载
  • 作者:Tuukka Norri ; Bastien Cazaux ; Dmitry Kosolobov
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2018
  • 卷号:113
  • 页码:1-15
  • DOI:10.4230/LIPIcs.WABI.2018.15
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:Given a threshold L and a set R = {R_1, ..., R_m} of m strings (haplotype sequences), each having length n, the minimum segmentation problem for founder reconstruction is to partition [1,n] into set P of disjoint segments such that each segment [a,b] in P has length at least L and the number d(a,b)=|{R_i[a,b] : 1 <= i <= m}| of distinct substrings at segment [a,b] is minimized over [a,b] in P. The distinct substrings in the segments represent founder blocks that can be concatenated to form max{d(a,b) : [a,b] in P} founder sequences representing the original R such that crossovers happen only at segment boundaries. We give an optimal O(mn) time algorithm to solve the problem, improving over earlier O(mn^2). This improvement enables to exploit the algorithm on a pan-genomic setting of input strings being aligned haplotype sequences of complete human chromosomes, with a goal of finding a representative set of references that can be indexed for read alignment and variant calling. We implemented the new algorithm and give some experimental evidence on the practicality of the approach on this pan-genomic setting.
  • 关键词:Pan-genome indexing; founder reconstruction; dynamic programming; positional Burrows-Wheeler transform; range minimum query
国家哲学社会科学文献中心版权所有