DROPS - Document

License:

Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.WABI.2019.21
URN: urn:nbn:de:0030-drops-110510
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2019/11051/

Go to the corresponding LIPIcs Volume Portal

Cao, Zhen ; Zhu, Jiafan ; Nakhleh, Luay

Empirical Performance of Tree-Based Inference of Phylogenetic Networks

pdf-format:

LIPIcs-WABI-2019-21.pdf (1 MB)

Abstract

Phylogenetic networks extend the phylogenetic tree structure and allow for modeling vertical and horizontal evolution in a single framework. Statistical inference of phylogenetic networks is prohibitive and currently limited to small networks. An approach that could significantly improve phylogenetic network space exploration is based on first inferring an evolutionary tree of the species under consideration, and then augmenting the tree into a network by adding a set of "horizontal" edges to better fit the data.
In this paper, we study the performance of such an approach on networks generated under a birth-hybridization model and explore its feasibility as an alternative to approaches that search the phylogenetic network space directly (without relying on a fixed underlying tree). We find that the concatenation method does poorly at obtaining a "backbone" tree that could be augmented into the correct network, whereas the popular species tree inference method ASTRAL does significantly better at such a task. We then evaluated the tree-to-network augmentation phase under the minimizing deep coalescence and pseudo-likelihood criteria. We find that even though this is a much faster approach than the direct search of the network space, the accuracy is much poorer, even when the backbone tree is a good starting tree.
Our results show that tree-based inference of phylogenetic networks could yield very poor results. As exploration of the network space directly in search of maximum likelihood estimates or a representative sample of the posterior is very expensive, significant improvements to the computational complexity of phylogenetic network inference are imperative if analyses of large data sets are to be performed. We show that a recently developed divide-and-conquer approach significantly outperforms tree-based inference in terms of accuracy, albeit still at a higher computational cost.

BibTeX - Entry

@InProceedings{cao_et_al:LIPIcs:2019:11051,
  author =	{Zhen Cao and Jiafan Zhu and Luay Nakhleh},
  title =	{{Empirical Performance of Tree-Based Inference of Phylogenetic Networks}},
  booktitle =	{19th International Workshop on Algorithms in Bioinformatics (WABI 2019)},
  pages =	{21:1--21:13},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-123-8},
  ISSN =	{1868-8969},
  year =	{2019},
  volume =	{143},
  editor =	{Katharina T. Huber and Dan Gusfield},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2019/11051},
  URN =		{urn:nbn:de:0030-drops-110510},
  doi =		{10.4230/LIPIcs.WABI.2019.21},
  annote =	{Keywords: Phylogenetic networks, species tree, tree-based networks, multi-locus phylogeny}
}

Keywords: Phylogenetic networks, species tree, tree-based networks, multi-locus phylogeny

Collection: 19th International Workshop on Algorithms in Bioinformatics (WABI 2019)

Issue Date: 2019

Date of publication: 03.09.2019

DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI

Keywords:		Phylogenetic networks, species tree, tree-based networks, multi-locus phylogeny
Collection:		19th International Workshop on Algorithms in Bioinformatics (WABI 2019)
Issue Date:		2019
Date of publication:		03.09.2019