Emergent Mind

Streaming Algorithms For Computing Edit Distance Without Exploiting Suffix Trees

(1607.03718)
Published Jul 13, 2016 in cs.DS and cs.CC

Abstract

The edit distance is a way of quantifying how similar two strings are to one another by counting the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. In this paper we study the computational problem of computing the edit distance between a pair of strings where their distance is bounded by a parameter $k\ll n$. We present two streaming algorithms for computing edit distance: One runs in time $O(n+k2)$ and the other $n+O(k3)$. By writing $n+O(k3)$ we want to emphasize that the number of operations per an input symbol is a small constant. In particular, the running time does not depend on the alphabet size, and the algorithm should be easy to implement. Previously a streaming algorithm with running time $O(n+k4)$ was given in the paper by the current authors (STOC'16). The best off-line algorithm runs in time $O(n+k2)$ (Landau et al., 1998) which is known to be optimal under the Strong Exponential Time Hypothesis.

We're not able to analyze this paper right now due to high demand.

Please check back later (sorry!).

Generate a summary of this paper on our Pro plan:

We ran into a problem analyzing this paper.

Newsletter

Get summaries of trending comp sci papers delivered straight to your inbox:

Unsubscribe anytime.