首页    期刊浏览 2024年07月05日 星期五
登录注册

文章基本信息

  • 标题:Whole genome sequencing data of multiple individuals of Pakistani descent
  • 本地全文:下载
  • 作者:Shahid Y. Khan ; Muhammad Ali ; Mei-Chong W. Lee
  • 期刊名称:Scientific Data
  • 电子版ISSN:2052-4463
  • 出版年度:2020
  • 卷号:7
  • 期号:1
  • 页码:1-9
  • DOI:10.1038/s41597-020-00664-2
  • 语种:English
  • 出版社:Nature Publishing Group
  • 摘要:Here we report whole genome sequencing of four individuals (H3, H4, H5, and H6) from a family of Pakistani descent. Whole genome sequencing yielded 1084.92, 894.73, 1068.62, and 1005.77 million mapped reads corresponding to 162.73, 134.21, 160.29, and 150.86鈥塆b sequence data and 52.49x, 43.29x, 51.70x, and 48.66x average coverage for H3, H4, H5, and H6, respectively. We identified 3,529,659, 3,478,495, 3,407,895, and 3,426,862 variants in the genomes of聽H3, H4, H5, and H6, respectively, including 1,668,024 variants common in the four genomes. Further, we identified 42,422, 39,824, 28,599, and 35,206 novel variants in the genomes of聽H3, H4, H5, and H6, respectively. A major fraction of the variants identified in the four genomes聽reside within the intergenic regions of the genome. Single nucleotide polymorphism (SNP) genotype based comparative analysis with ethnic populations of 1000 Genomes database linked the ancestry of all four genomes with the South Asian populations, which was further supported by mitochondria based haplogroup analysis. In conclusion, we report whole genome sequencing of four individuals of Pakistani descent.
国家哲学社会科学文献中心版权所有