License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.CPM.2020.24
URN: urn:nbn:de:0030-drops-121497
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2020/12149/
Munro, J. Ian ;
Navarro, Gonzalo ;
Nekrich, Yakov
Text Indexing and Searching in Sublinear Time
Abstract
We introduce the first index that can be built in o(n) time for a text of length n, and can also be queried in o(q) time for a pattern of length q. On an alphabet of size σ, our index uses O(n log σ) bits, is built in O(n log σ / √{log n}) deterministic time, and computes the number of occurrences of the pattern in time O(q/log_σ n + log n log_σ n). Each such occurrence can then be found in O(log n) time. Other trade-offs between the space usage and the cost of reporting occurrences are also possible.
BibTeX - Entry
@InProceedings{munro_et_al:LIPIcs:2020:12149,
author = {J. Ian Munro and Gonzalo Navarro and Yakov Nekrich},
title = {{Text Indexing and Searching in Sublinear Time}},
booktitle = {31st Annual Symposium on Combinatorial Pattern Matching (CPM 2020)},
pages = {24:1--24:15},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-149-8},
ISSN = {1868-8969},
year = {2020},
volume = {161},
editor = {Inge Li G{\o}rtz and Oren Weimann},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/opus/volltexte/2020/12149},
URN = {urn:nbn:de:0030-drops-121497},
doi = {10.4230/LIPIcs.CPM.2020.24},
annote = {Keywords: data structures, string indexes}
}
Keywords: |
|
data structures, string indexes |
Collection: |
|
31st Annual Symposium on Combinatorial Pattern Matching (CPM 2020) |
Issue Date: |
|
2020 |
Date of publication: |
|
09.06.2020 |