首页    期刊浏览 2024年10月07日 星期一
登录注册

文章基本信息

  • 标题:Suffix Arrays with a Twist
  • 本地全文:下载
  • 作者:Tomasz M. Kowalski ; Szymon Grabowski ; Kimmo Fredriksson
  • 期刊名称:COMPUTING AND INFORMATICS
  • 印刷版ISSN:1335-9150
  • 出版年度:2019
  • 卷号:38
  • 期号:3
  • 页码:555-574
  • DOI:10.31577/cai 2019 3 555
  • 出版社:COMPUTING AND INFORMATICS
  • 摘要:The suffix array is a classic full-text index, combining effectiveness with simplicity. We discuss three approaches aiming to improve its efficiency even more: changes to the navigation, data layout and adding extra data. In short, we show that i) the way how we search for the right interval boundary impacts significantly the overall search speed, ii) a B-tree data layout easily wins over the standard one, iii) the well-known idea of a lookup table for the prefixes of the suffixes can be refined with using compression, iv) caching prefixes of the suffixes in a helper array can pose another practical space-time tradeoff.
  • 其他摘要:The suffix array is a classic full-text index, combining effectiveness with simplicity. We discuss three approaches aiming to improve its efficiency even more: changes to the navigation, data layout and adding extra data. In short, we show that i) the way how we search for the right interval boundary impacts significantly the overall search speed, ii) a B-tree data layout easily wins over the standard one, iii) the well-known idea of a lookup table for the prefixes of the suffixes can be refined with using compression, iv) caching prefixes of the suffixes in a helper array can pose another practical space-time tradeoff.
  • 关键词:Suffix array; data structures; text indexes; hashing
  • 其他关键词:Suffix array; data structures; text indexes; hashing
国家哲学社会科学文献中心版权所有