License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.TIME.2021.2
URN: urn:nbn:de:0030-drops-147785
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2021/14778/
Pedersen, Torben Bach
Extreme-Scale Model-Based Time Series Management with ModelarDB (Invited Talk)
Abstract
To monitor critical industrial devices such as wind turbines, high quality sensors sampled at a high frequency are increasingly used. Current technology does not handle these extreme-scale time series well [Søren Kejser Jensen et al., 2017], so only simple aggregates are traditionally stored, removing outliers and fluctuations that could indicate problems. As a remedy, we present a model-based approach for managing extreme-scale time series that approximates the time series values using mathematical functions (models) and stores only model coefficients rather than data values. Compression is done both for individual time series and for correlated groups of time series. The keynote will present concepts, techniques, and algorithms from model-based time series management and our implementation of these in the open source Time Series Management System (TSMS) ModelarDB[Søren Kejser Jensen et al., 2018; Søren Kejser Jensen et al., 2019; Søren Kejser Jensen et al., 2021] . Furthermore, it will present our experimental evaluation of ModelarDB on extreme-scale real-world time series, which shows that that compared to widely used Big Data formats, ModelarDB provides up to 14× faster ingestion due to high compression, 113× better compression due to its adaptability, 573× faster aggregatation by using models, and close to linear scale-out scalability. ModelarDB is being commercialized by the spin-out company ModelarData.
BibTeX - Entry
@InProceedings{pedersen:LIPIcs.TIME.2021.2,
author = {Pedersen, Torben Bach},
title = {{Extreme-Scale Model-Based Time Series Management with ModelarDB}},
booktitle = {28th International Symposium on Temporal Representation and Reasoning (TIME 2021)},
pages = {2:1--2:2},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-206-8},
ISSN = {1868-8969},
year = {2021},
volume = {206},
editor = {Combi, Carlo and Eder, Johann and Reynolds, Mark},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/opus/volltexte/2021/14778},
URN = {urn:nbn:de:0030-drops-147785},
doi = {10.4230/LIPIcs.TIME.2021.2},
annote = {Keywords: Model-based storage, approximate query processing, time series management, extreme-scale data}
}
Keywords: |
|
Model-based storage, approximate query processing, time series management, extreme-scale data |
Collection: |
|
28th International Symposium on Temporal Representation and Reasoning (TIME 2021) |
Issue Date: |
|
2021 |
Date of publication: |
|
16.09.2021 |