首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Molecular sequence accuracy and the analysis of protein coding regions.
  • 本地全文:下载
  • 作者:D J States ; D Botstein
  • 期刊名称:Proceedings of the National Academy of Sciences
  • 印刷版ISSN:0027-8424
  • 电子版ISSN:1091-6490
  • 出版年度:1991
  • 卷号:88
  • 期号:13
  • 页码:5518-5522
  • DOI:10.1073/pnas.88.13.5518
  • 语种:English
  • 出版社:The National Academy of Sciences of the United States of America
  • 摘要:Molecular sequences, like all experimental data, have finite error rates. The impact of errors on the information content of molecular sequence data is dependent on the analytic paradigm used to interpret the data. We studied the impact of nucleic acid sequence errors on the ability to align predicted amino acid sequences with the sequences of related proteins. We found that with a simultaneous translation and alignment algorithm, identification of sequence homologies is resilient to the introduction of random errors. Proteins with greater than 30% sequence identity can be reliably recognized even in the presence of 1% frameshifting (insertion or deletion) error rates and 5% base substitution rates. Incorporation of prior knowledge about the location and characteristics of errors improves tolerance to error of amino acid sequence alignments. Similarly, inclusion of prior knowledge of biased codon utilization by yeast (Saccharomyces cerevisiae) allows reliable detection of correct reading frames in yeast sequences even in the presence of 5% substitution and 1% frameshift errors.
国家哲学社会科学文献中心版权所有