License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.SEA.2021.14
URN: urn:nbn:de:0030-drops-137861
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2021/13786/
Chen, Samantha ;
Wang, Yusu
Approximation Algorithms for 1-Wasserstein Distance Between Persistence Diagrams
Abstract
Recent years have witnessed a tremendous growth using topological summaries, especially the persistence diagrams (encoding the so-called persistent homology) for analyzing complex shapes. Intuitively, persistent homology maps a potentially complex input object (be it a graph, an image, or a point set and so on) to a unified type of feature summary, called the persistence diagrams. One can then carry out downstream data analysis tasks using such persistence diagram representations. A key problem is to compute the distance between two persistence diagrams efficiently. In particular, a persistence diagram is essentially a multiset of points in the plane, and one popular distance is the so-called 1-Wasserstein distance between persistence diagrams. In this paper, we present two algorithms to approximate the 1-Wasserstein distance for persistence diagrams in near-linear time. These algorithms primarily follow the same ideas as two existing algorithms to approximate optimal transport between two finite point-sets in Euclidean spaces via randomly shifted quadtrees. We show how these algorithms can be effectively adapted for the case of persistence diagrams. Our algorithms are much more efficient than previous exact and approximate algorithms, both in theory and in practice, and we demonstrate its efficiency via extensive experiments. They are conceptually simple and easy to implement, and the code is publicly available in github.
BibTeX - Entry
@InProceedings{chen_et_al:LIPIcs.SEA.2021.14,
author = {Chen, Samantha and Wang, Yusu},
title = {{Approximation Algorithms for 1-Wasserstein Distance Between Persistence Diagrams}},
booktitle = {19th International Symposium on Experimental Algorithms (SEA 2021)},
pages = {14:1--14:19},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-185-6},
ISSN = {1868-8969},
year = {2021},
volume = {190},
editor = {Coudert, David and Natale, Emanuele},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/opus/volltexte/2021/13786},
URN = {urn:nbn:de:0030-drops-137861},
doi = {10.4230/LIPIcs.SEA.2021.14},
annote = {Keywords: persistence diagrams, approximation algorithms, Wasserstein distance, optimal transport}
}