首页    期刊浏览 2024年08月31日 星期六
登录注册

文章基本信息

  • 标题:Time-Space Trade-Offs for Lempel-Ziv Compressed Indexing
  • 本地全文:下载
  • 作者:Philip Bille ; Mikko Berggren Ettienne ; Inge Li G\ortz
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2017
  • 卷号:78
  • 页码:16:1-16:17
  • DOI:10.4230/LIPIcs.CPM.2017.16
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:Given a string S, the compressed indexing problem is to preprocess S into a compressed representation that supports fast substring queries. The goal is to use little space relative to the compressed size of S while supporting fast queries. We present a compressed index based on the Lempel-Ziv 1977 compression scheme. Let n, and z denote the size of the input string, and the compressed LZ77 string, respectively. We obtain the following time-space trade-offs. Given a pattern string P of length m, we can solve the problem in (i) O(m + occ lglg n) time using O(z lg(n/z) lglg z) space, or (ii) O(m(1 + lg^e z / lg(n/z)) + occ(lglg n + lg^e z)) time using O(z lg(n/z)) space, for any 0 0, this becomes O(m). Our index also supports extraction of any substring of length l in O(l + lg(n/z)) time. Technically, our results are obtained by novel extensions and combinations of existing data structures of independent interest, including a new batched variant of weak prefix search.
  • 关键词:compressed indexing; pattern matching; LZ77; prefix search
国家哲学社会科学文献中心版权所有