首页    期刊浏览 2024年11月24日 星期日
登录注册

文章基本信息

  • 标题:Hidden Markov Dirichlet Process: Modeling Genetic Inference in Open Ancestral Space
  • 本地全文:下载
  • 作者:Eric P. Xing ; Kyung-Ah Sohn
  • 期刊名称:Bayesian Analysis
  • 印刷版ISSN:1931-6690
  • 电子版ISSN:1936-0975
  • 出版年度:2007
  • 卷号:2
  • 期号:3
  • 页码:501--528,
  • 出版社:International Society for Bayesian Analysis
  • 摘要:The problem of inferring the population structure, linkage disequi- librium pattern, and chromosomal recombination hotspots from genetic polymor- phism data is essential for understanding the origin and characteristics of genome variations, with important applications to the genetic analysis of disease propen- sities and other complex traits. Statistical genetic methodologies developed so far mostly address these problems separately using specialized models ranging from coalescence and admixture models for population structures, to hidden Markov models and renewal processes for recombination; but most of these approaches ignore the inherent uncertainty in the genetic complexity (e.g., the number of ge- netic founders of a population) of the data and the close statistical and biological relationships among objects studied in these problems. We present a new statis- tical framework called hidden Markov Dirichlet process (HMDP) to jointly model the genetic recombinations among a possibly in nite number of founders and the coalescence-with-mutation events in the resulting genealogies. The HMDP posits that a haplotype of genetic markers is generated by a sequence of recombination events that select an ancestor for each locus from an unbounded set of founders according to a 1st-order Markov transition process. Conjoining this process with a mutation model, our method accommodates both between-lineage recombina- tion and within-lineage sequence variations, and leads to a compact and natural interpretation of the population structure and inheritance process underlying hap- lotype data. We have developed an ecient sampling algorithm for HMDP based on a two-level nested Polya urn scheme, and we present experimental results on joint inference of population structure, linkage disequilibrium, and recombination hotspots based on HMDP. On both simulated and real SNP haplotype data, our method performs competitively or signi cantly better than extant methods in un- covering the recombination hotspots along chromosomal loci; and in addition it also infers the ancestral genetic patterns and o ers a highly accurate map of an- cestral compositions of modern populations.
  • 关键词:Dirichlet Process, Hierarchical DP, hidden Markov model, MCMC, statistical genetics, recombination, population structure, SNP.
国家哲学社会科学文献中心版权所有