License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.CPM.2017.18
URN: urn:nbn:de:0030-drops-73234
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2017/7323/
I, Tomohiro
Longest Common Extensions with Recompression
Abstract
Given two positions i and j in a string T of length N, a longest common extension (LCE) query asks for the length of the longest common prefix between suffixes beginning at i and j. A compressed LCE data structure stores T in a compressed form while supporting fast LCE queries. In this article we show that the recompression technique is a powerful tool for compressed LCE data structures. We present a new compressed LCE data structure of size O(z lg (N/z)) that supports LCE queries in O(lg N) time, where z is the size of Lempel-Ziv 77 factorization without self-reference of T. Given T as an uncompressed form, we show how to build our data structure in O(N) time and space. Given T as a grammar compressed form, i.e., a straight-line program of size n generating T, we show how to build our data structure in O(n lg (N/n)) time and O(n + z lg (N/z)) space. Our algorithms are deterministic and always return correct answers.
BibTeX - Entry
@InProceedings{i:LIPIcs:2017:7323,
author = {Tomohiro I},
title = {{Longest Common Extensions with Recompression}},
booktitle = {28th Annual Symposium on Combinatorial Pattern Matching (CPM 2017)},
pages = {18:1--18:15},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-039-2},
ISSN = {1868-8969},
year = {2017},
volume = {78},
editor = {Juha K{\"a}rkk{\"a}inen and Jakub Radoszewski and Wojciech Rytter},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {http://drops.dagstuhl.de/opus/volltexte/2017/7323},
URN = {urn:nbn:de:0030-drops-73234},
doi = {10.4230/LIPIcs.CPM.2017.18},
annote = {Keywords: Longest Common Extension (LCE) queries, compressed data structure, grammar compressed strings, Straight-Line Program (SLP)}
}
Keywords: |
|
Longest Common Extension (LCE) queries, compressed data structure, grammar compressed strings, Straight-Line Program (SLP) |
Collection: |
|
28th Annual Symposium on Combinatorial Pattern Matching (CPM 2017) |
Issue Date: |
|
2017 |
Date of publication: |
|
30.06.2017 |