License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/DFU.Vol3.11041.195
URN: urn:nbn:de:0030-drops-34737
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2012/3473/
Go to the corresponding DFU Volume Portal


Weninger, Felix ; Schuller, Björn ; Liem, Cynthia C.S. ; Kurth, Frank ; Hanjalic, Alan

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines

pdf-format:
12.pdf (1 MB)


Abstract

The emerging field of Music Information Retrieval (MIR) has been influenced by neighboring domains in signal processing and machine learning, including automatic speech recognition, image processing and text information retrieval. In this contribution, we start with concrete examples for methodology transfer between speech and music processing, oriented on the building blocks of pattern recognition: preprocessing, feature extraction, and classification/decoding. We then assume a higher level viewpoint when describing sources of mutual inspiration derived from text and image information retrieval. We conclude that dealing with the peculiarities of music in MIR research has contributed to advancing the state-of-the-art in other fields, and that many future challenges in MIR are strikingly similar to those that other research areas have been facing.

BibTeX - Entry

@InCollection{weninger_et_al:DFU:2012:3473,
  author =	{Felix Weninger and Bj{\"o}rn Schuller and Cynthia C.S. Liem and Frank Kurth and Alan Hanjalic},
  title =	{{Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines}},
  booktitle =	{Multimodal Music Processing},
  pages =	{195--216},
  series =	{Dagstuhl Follow-Ups},
  ISBN =	{978-3-939897-37-8},
  ISSN =	{1868-8977},
  year =	{2012},
  volume =	{3},
  editor =	{Meinard M{\"u}ller and Masataka Goto and Markus Schedl},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2012/3473},
  URN =		{urn:nbn:de:0030-drops-34737},
  doi =		{10.4230/DFU.Vol3.11041.195},
  annote =	{Keywords: Feature extraction, machine learning, multimodal fusion, evaluation, human factors, cross-domain methodology transfer}
}

Keywords: Feature extraction, machine learning, multimodal fusion, evaluation, human factors, cross-domain methodology transfer
Collection: Multimodal Music Processing
Issue Date: 2012
Date of publication: 27.04.2012


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI