License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.WABI.2018.19
URN: urn:nbn:de:0030-drops-93218
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2018/9321/
Elworth, Ryan A. Leo ;
Allen, Chabrielle ;
Benedict, Travis ;
Dulworth, Peter ;
Nakhleh, Luay
DGEN: A Test Statistic for Detection of General Introgression Scenarios
Abstract
When two species hybridize, one outcome is the integration of genetic material from one species into the genome of the other, a process known as introgression. Detecting introgression in genomic data is a very important question in evolutionary biology. However, given that hybridization occurs between closely related species, a complicating factor for introgression detection is the presence of incomplete lineage sorting, or ILS. The D-statistic, famously referred to as the "ABBA-BABA" test, was proposed for introgression detection in the presence of ILS in data sets that consist of four genomes. More recently, D_FOIL - a set of statistics - was introduced to extend the D-statistic to data sets of five genomes.
The major contribution of this paper is demonstrating that the invariants underlying both the D-statistic and D_FOIL can be derived automatically from the probability mass functions of gene tree topologies under the null species tree model and alternative phylogenetic network model. Computational requirements aside, this automatic derivation provides a way to generalize these statistics to data sets of any size and with any scenarios of introgression. We demonstrate the accuracy of the general statistic, which we call D_GEN, on simulated data sets with varying rates of introgression, and apply it to an empirical data set of mosquito genomes.
We have implemented D_GEN and made it available, both as a graphical user interface tool and as a command-line tool, as part of the freely available, open-source software package ALPHA (https://github.com/chilleo/ALPHA).
BibTeX - Entry
@InProceedings{elworth_et_al:LIPIcs:2018:9321,
author = {Ryan A. Leo Elworth and Chabrielle Allen and Travis Benedict and Peter Dulworth and Luay Nakhleh},
title = {{DGEN: A Test Statistic for Detection of General Introgression Scenarios}},
booktitle = {18th International Workshop on Algorithms in Bioinformatics (WABI 2018)},
pages = {19:1--19:13},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-082-8},
ISSN = {1868-8969},
year = {2018},
volume = {113},
editor = {Laxmi Parida and Esko Ukkonen},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {http://drops.dagstuhl.de/opus/volltexte/2018/9321},
URN = {urn:nbn:de:0030-drops-93218},
doi = {10.4230/LIPIcs.WABI.2018.19},
annote = {Keywords: Introgression, genealogies, phylogenetic networks}
}
Keywords: |
|
Introgression, genealogies, phylogenetic networks |
Collection: |
|
18th International Workshop on Algorithms in Bioinformatics (WABI 2018) |
Issue Date: |
|
2018 |
Date of publication: |
|
02.08.2018 |