Emergent Mind

Abstract

The recently introduced longest common substring with $k$-mismatches ($k$-LCF) problem is to find, given two sequences $S1$ and $S2$ of length $n$ each, a longest substring $A1$ of $S1$ and $A2$ of $S2$ such that the Hamming distance between $A1$ and $A2$ is at most $k$. So far, the only subquadratic time result for this problem was known for $k = 1$~\cite{FGKU2014}. We first present two output-dependent algorithms solving the $k$-LCF problem and show that for $k = O(\log{1-\varepsilon} n)$, where $\varepsilon > 0$, at least one of them works in subquadratic time, using $O(n)$ words of space. The choice of one of these two algorithms to be applied for a given input can be done after linear time and space preprocessing. Finally we present a tabulation-based algorithm working, in its range of applicability, in $O(n2\log\min(k+\ell_0, \sigma)/\log n)$ time, where $\ell_0$ is the length of the standard longest common substring.

We're not able to analyze this paper right now due to high demand.

Please check back later (sorry!).

Generate a summary of this paper on our Pro plan:

We ran into a problem analyzing this paper.

Newsletter

Get summaries of trending comp sci papers delivered straight to your inbox:

Unsubscribe anytime.