Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Computing matching statistics on Wheeler DFAs (2301.05338v1)

Published 13 Jan 2023 in cs.DS

Abstract: Matching statistics were introduced to solve the approximate string matching problem, which is a recurrent subroutine in bioinformatics applications. In 2010, Ohlebusch et al. [SPIRE 2010] proposed a time and space efficient algorithm for computing matching statistics which relies on some components of a compressed suffix tree - notably, the longest common prefix (LCP) array. In this paper, we show how their algorithm can be generalized from strings to Wheeler deterministic finite automata. Most importantly, we introduce a notion of LCP array for Wheeler automata, thus establishing a first clear step towards extending (compressed) suffix tree functionalities to labeled graphs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Alessio Conte (16 papers)
  2. Nicola Cotumaccio (15 papers)
  3. Travis Gagie (123 papers)
  4. Giovanni Manzini (38 papers)
  5. Nicola Prezza (59 papers)
  6. Marinella Sciortino (23 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.