Emergent Mind

Longest common substrings with k mismatches

(1409.1694)
Published Sep 5, 2014 in cs.DS

Abstract

The longest common substring with $k$-mismatches problem is to find, given two strings $S1$ and $S2$, a longest substring $A1$ of $S1$ and $A2$ of $S2$ such that the Hamming distance between $A1$ and $A2$ is $\le k$. We introduce a practical $O(nm)$ time and $O(1)$ space solution for this problem, where $n$ and $m$ are the lengths of $S1$ and $S2$, respectively. This algorithm can also be used to compute the matching statistics with $k$-mismatches of $S1$ and $S2$ in $O(nm)$ time and $O(m)$ space. Moreover, we also present a theoretical solution for the $k = 1$ case which runs in $O(n \log m)$ time, assuming $m\le n$, and uses $O(m)$ space, improving over the existing $O(nm)$ time and $O(m)$ space bound of Babenko and Starikovskaya.

We're not able to analyze this paper right now due to high demand.

Please check back later (sorry!).

Generate a summary of this paper on our Pro plan:

We ran into a problem analyzing this paper.

Newsletter

Get summaries of trending comp sci papers delivered straight to your inbox:

Unsubscribe anytime.