首页    期刊浏览 2024年11月13日 星期三
登录注册

文章基本信息

  • 标题:Automatic classification of protein structure by using Gauss integrals
  • 本地全文:下载
  • 作者:Peter Røgen ; Boris Fain
  • 期刊名称:Proceedings of the National Academy of Sciences
  • 印刷版ISSN:0027-8424
  • 电子版ISSN:1091-6490
  • 出版年度:2003
  • 卷号:100
  • 期号:1
  • 页码:119-124
  • DOI:10.1073/pnas.2636460100
  • 语种:English
  • 出版社:The National Academy of Sciences of the United States of America
  • 摘要:We introduce a method of looking at, analyzing, and comparing protein structures. The topology of a protein is captured by 30 numbers inspired by Vassiliev knot invariants. To illustrate the simplicity and power of this topological approach, we construct a measure (scaled Gauss metric, SGM) of similarity of protein shapes. Under this metric, protein chains naturally separate into fold clusters. We use SGM to construct an automatic classification procedure for the CATH2.4 database. The method is very fast because it requires neither alignment of the chains nor any chain-chain comparison. It also has only one adjustable parameter. We assign 95.51% of the chains into the proper C (class), A (architecture), T (topology), and H (homologous superfamily) fold, find all new folds, and detect no false geometric positives. Using the SGM, we display a "map" of the space of folds projected onto two dimensions, show the relative locations of the major structural classes, and "zoom into" the space of proteins to show architecture, topology, and fold clusters. The existence of a simple measure of a protein fold computed from the chain path will have a major impact on automatic fold classification.
  • 关键词:CATH protein database|scaled Gauss metric|structural genomics| knot theory
国家哲学社会科学文献中心版权所有