License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.ICALP.2021.71
URN: urn:nbn:de:0030-drops-141400
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2021/14140/
Ganguly, Arnab ;
Patel, Dhrumil ;
Shah, Rahul ;
Thankachan, Sharma V.
LF Successor: Compact Space Indexing for Order-Isomorphic Pattern Matching
Abstract
Two strings are order isomorphic iff the relative ordering of their characters is the same at all positions. For a given text T[1,n] over an ordered alphabet of size σ, we can maintain an order-isomorphic suffix tree/array in O(nlog n) bits and support (order-isomorphic) pattern/substring matching queries efficiently. It is interesting to know if we can encode these structures in space close to the text’s size of nlogσ bits. We answer this question positively by presenting an O(nlog σ)-bit index that allows access to any entry in order-isomorphic suffix array (and its inverse array) in t_{SA} = {O}(log²n/logσ) time. For any pattern P given as a query, this index can count the number of substrings of T that are order-isomorphic to P (denoted by occ) in {O}((|P|logσ+t_{SA})log n) time using standard techniques. Also, it can report the locations of those substrings in additional O(occ ⋅ t_{SA}) time.
BibTeX - Entry
@InProceedings{ganguly_et_al:LIPIcs.ICALP.2021.71,
author = {Ganguly, Arnab and Patel, Dhrumil and Shah, Rahul and Thankachan, Sharma V.},
title = {{LF Successor: Compact Space Indexing for Order-Isomorphic Pattern Matching}},
booktitle = {48th International Colloquium on Automata, Languages, and Programming (ICALP 2021)},
pages = {71:1--71:19},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-195-5},
ISSN = {1868-8969},
year = {2021},
volume = {198},
editor = {Bansal, Nikhil and Merelli, Emanuela and Worrell, James},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/opus/volltexte/2021/14140},
URN = {urn:nbn:de:0030-drops-141400},
doi = {10.4230/LIPIcs.ICALP.2021.71},
annote = {Keywords: Succinct data structures, Pattern Matching}
}
Keywords: |
|
Succinct data structures, Pattern Matching |
Collection: |
|
48th International Colloquium on Automata, Languages, and Programming (ICALP 2021) |
Issue Date: |
|
2021 |
Date of publication: |
|
02.07.2021 |