首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:Fast and Linear-Time String Matching Algorithms Based on the Distances of q-Gram Occurrences
  • 本地全文:下载
  • 作者:Satoshi Kobayashi ; Diptarama Hendrian ; Ryo Yoshinaka
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2020
  • 卷号:160
  • 页码:13:1-13:13
  • DOI:10.4230/LIPIcs.SEA.2020.13
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:Given a text T of length n and a pattern P of length m, the string matching problem is a task to find all occurrences of P in T. In this study, we propose an algorithm that solves this problem in O((n + m)q) time considering the distance between two adjacent occurrences of the same q-gram contained in P. We also propose a theoretical improvement of it which runs in O(n + m) time, though it is not necessarily faster in practice. We compare the execution times of our and existing algorithms on various kinds of real and artificial datasets such as an English text, a genome sequence and a Fibonacci string. The experimental results show that our algorithm is as fast as the state-of-the-art algorithms in many cases, particularly when a pattern frequently appears in a text.
  • 关键词:String matching algorithm; text processing
国家哲学社会科学文献中心版权所有