License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.SoCG.2016.65
URN: urn:nbn:de:0030-drops-59577
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2016/5957/
Go to the corresponding LIPIcs Volume Portal


Bendich, Paul ; Gasparovic, Ellen ; Harer, John ; Tralie, Christopher

Geometric Models for Musical Audio Data

pdf-format:
LIPIcs-SoCG-2016-65.pdf (0.9 MB)


Abstract

We study the geometry of sliding window embeddings of audio features that summarize perceptual information about audio, including its pitch and timbre. These embeddings can be viewed as point clouds in high dimensions, and we add structure to the point clouds using a cover tree with adaptive thresholds based on multi-scale local principal component analysis to automatically assign points to clusters. We connect neighboring clusters in a scaffolding graph, and we use knowledge of stratified space structure to refine our estimates of dimension in each cluster, demonstrating in our music applications that choruses and verses have higher dimensional structure, while transitions between them are lower dimensional. We showcase our technique with an interactive web-based application powered by Javascript and WebGL which plays music synchronized with a principal component analysis embedding of the point cloud down to 3D. We also render the clusters and the scaffolding on top of this projection to visualize the transitions between different sections of the music.

BibTeX - Entry

@InProceedings{bendich_et_al:LIPIcs:2016:5957,
  author =	{Paul Bendich and Ellen Gasparovic and John Harer and Christopher Tralie},
  title =	{{Geometric Models for Musical Audio Data}},
  booktitle =	{32nd International Symposium on Computational Geometry (SoCG 2016)},
  pages =	{65:1--65:5},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-009-5},
  ISSN =	{1868-8969},
  year =	{2016},
  volume =	{51},
  editor =	{S{\'a}ndor Fekete and Anna Lubiw},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2016/5957},
  URN =		{urn:nbn:de:0030-drops-59577},
  doi =		{10.4230/LIPIcs.SoCG.2016.65},
  annote =	{Keywords: Geometric Models, Audio Analysis, High Dimensional Data Analysis, Stratified Space Models}
}

Keywords: Geometric Models, Audio Analysis, High Dimensional Data Analysis, Stratified Space Models
Collection: 32nd International Symposium on Computational Geometry (SoCG 2016)
Issue Date: 2016
Date of publication: 10.06.2016


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI