License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.CPM.2016.22
URN: urn:nbn:de:0030-drops-60719
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2016/6071/
Takagi, Takuya ;
Inenaga, Shunsuke ;
Arimura, Hiroki
Fully-online Construction of Suffix Trees for Multiple Texts
Abstract
We consider fully-online construction of indexing data structures for multiple texts. Let T = {T_1, ..., T_K} be a collection of texts. By fully-online, we mean that a new character can be appended to any text in T at any time. This is a natural generalization of semi-online construction of indexing data structures for multiple texts in which, after a new character is appended to the kth text T_k, then its previous texts T_1, ..., T_k-1 will remain static. Our fully-online scenario arises when we maintain dynamic indexes for multi-sensor data. Let N and sigma denote the total length of texts in T and the alphabet size, respectively. We first show that the algorithm by Blumer et al. [Theoretical Computer Science, 40:31-55, 1985] to construct the directed acyclic word graph (DAWG) for T can readily be extended to our fully-online setting, retaining O(N log sigma)-time and O(N)-space complexities. Then, we give a sophisticated fully-online algorithm which constructs the suffix tree for T in O(N log sigma) time and O(N) space. A key idea of this algorithm is synchronized maintenance of the DAWG and the suffix tree.
BibTeX - Entry
@InProceedings{takagi_et_al:LIPIcs:2016:6071,
author = {Takuya Takagi and Shunsuke Inenaga and Hiroki Arimura},
title = {{Fully-online Construction of Suffix Trees for Multiple Texts}},
booktitle = {27th Annual Symposium on Combinatorial Pattern Matching (CPM 2016)},
pages = {22:1--22:13},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-012-5},
ISSN = {1868-8969},
year = {2016},
volume = {54},
editor = {Roberto Grossi and Moshe Lewenstein},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {http://drops.dagstuhl.de/opus/volltexte/2016/6071},
URN = {urn:nbn:de:0030-drops-60719},
doi = {10.4230/LIPIcs.CPM.2016.22},
annote = {Keywords: suffix trees, DAWGs, multiple texts, online algorithms}
}
Keywords: |
|
suffix trees, DAWGs, multiple texts, online algorithms |
Collection: |
|
27th Annual Symposium on Combinatorial Pattern Matching (CPM 2016) |
Issue Date: |
|
2016 |
Date of publication: |
|
27.06.2016 |