首页    期刊浏览 2024年09月15日 星期日
登录注册

文章基本信息

  • 标题:Longest Common Substring with Approximately k Mismatches
  • 本地全文:下载
  • 作者:Tatiana Starikovskaya
  • 期刊名称:LIPIcs : Leibniz International Proceedings in Informatics
  • 电子版ISSN:1868-8969
  • 出版年度:2016
  • 卷号:54
  • 页码:21:1-21:11
  • DOI:10.4230/LIPIcs.CPM.2016.21
  • 出版社:Schloss Dagstuhl -- Leibniz-Zentrum fuer Informatik
  • 摘要:In the longest common substring problem we are given two strings of length n and must find a substring of maximal length that occurs in both strings. It is well-known that the problem can be solved in linear time, but the solution is not robust and can vary greatly when the input strings are changed even by one letter. To circumvent this, Leimester and Morgenstern introduced the problem of the longest common substring with k mismatches. Lately, this problem has received a lot of attention in the literature, and several algorithms have been suggested. The running time of these algorithms is n^{2-o(1)}, and unfortunately, conditional lower bounds have been shown which imply that there is little hope to improve this bound. In this paper we study a different but closely related problem of the longest common substring with approximately k mismatches and use computational geometry techniques to show that it admits a randomised solution with strongly subquadratic running time.
  • 关键词:Randomised algorithms; string similarity measures; longest common substring; sketching; locality-sensitive hashing
国家哲学社会科学文献中心版权所有