License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.WABI.2022.15
URN: urn:nbn:de:0030-drops-170494
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2022/17049/
Go to the corresponding LIPIcs Volume Portal


Wei, Wei ; Koslicki, David

WGSUniFrac: Applying UniFrac Metric to Whole Genome Shotgun Data

pdf-format:
LIPIcs-WABI-2022-15.pdf (2 MB)


Abstract

The UniFrac metric has proven useful in revealing diversity across metagenomic communities. Due to the phylogeny-based nature of this measurement, UniFrac has historically only been applied to 16S rRNA data. Simultaneously, Whole Genome Shotgun (WGS) metagenomics has been increasingly widely employed and proven to provide more information than 16S data, but a UniFrac-like diversity metric suitable for WGS data has not previously been developed. The main obstacle for UniFrac to be applied directly to WGS data is the absence of phylogenetic distances in the taxonomic relationship derived from WGS data. In this study, we demonstrate a method to overcome this intrinsic difference and compute the UniFrac metric on WGS data by assigning branch lengths to the taxonomic tree obtained from input taxonomic profiles. We conduct a series of experiments to demonstrate that this WGSUniFrac method is comparably robust to traditional 16S UniFrac and is not highly sensitive to branch lengths assignments, be they data-derived or model-prescribed.

BibTeX - Entry

@InProceedings{wei_et_al:LIPIcs.WABI.2022.15,
  author =	{Wei, Wei and Koslicki, David},
  title =	{{WGSUniFrac: Applying UniFrac Metric to Whole Genome Shotgun Data}},
  booktitle =	{22nd International Workshop on Algorithms in Bioinformatics (WABI 2022)},
  pages =	{15:1--15:22},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-243-3},
  ISSN =	{1868-8969},
  year =	{2022},
  volume =	{242},
  editor =	{Boucher, Christina and Rahmann, Sven},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2022/17049},
  URN =		{urn:nbn:de:0030-drops-170494},
  doi =		{10.4230/LIPIcs.WABI.2022.15},
  annote =	{Keywords: UniFrac, beta-diversity, Whole Genome Shotgun, microbial community similarity}
}

Keywords: UniFrac, beta-diversity, Whole Genome Shotgun, microbial community similarity
Collection: 22nd International Workshop on Algorithms in Bioinformatics (WABI 2022)
Issue Date: 2022
Date of publication: 26.08.2022
Supplementary Material: Software (Prototype of WGSUniFrac): https://github.com/KoslickiLab/WGSUniFrac archived at: https://archive.softwareheritage.org/swh:1:dir:d4a54046a885b69bdfdd5ca37d336ff7e51eace2
Software (to reproduce results of this paper): https://github.com/KoslickiLab/WGSUniFrac-reproducibles archived at: https://archive.softwareheritage.org/swh:1:dir:16f79da3471f763bd5649d1d40467ea47f3b0a9f


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI