首页    期刊浏览 2025年02月18日 星期二
登录注册

文章基本信息

  • 标题:Identifying Hierarchical Structure in Sequences: A linear-time algorithm
  • 本地全文:下载
  • 作者:C. G. Nevill-Manning ; I. H. Witten
  • 期刊名称:Journal of Artificial Intelligence Research
  • 印刷版ISSN:1076-9757
  • 出版年度:1997
  • 卷号:7
  • 页码:67-82
  • 出版社:American Association of Artificial
  • 摘要:SEQUITUR is an algorithm that infers a hierarchical structure from a sequence of discrete symbols by replacing repeated phrases with a grammatical rule that generates the phrase, and continuing this process recursively. The result is a hierarchical representation of the original sequence, which offers insights into its lexical structure. The algorithm is driven by two constraints that reduce the size of the grammar, and produce structure as a by-product. SEQUITUR breaks new ground by operating incrementally. Moreover, the method's simple structure permits a proof that it operates in space and time that is linear in the size of the input. Our implementation can process 50,000 symbols per second and has been applied to an extensive range of real world sequences.
国家哲学社会科学文献中心版权所有