License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/DagSemProc.08091.9
URN: urn:nbn:de:0030-drops-16113
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2008/1611/
Go to the corresponding Portal


Aycinena Lippow, Meg ; Kaelbling, Leslie Pack ; Lozano-Perez, Tomas

Learning Grammatical Models for Object Recognition

pdf-format:
08091.AycinenaMeg.Paper.1611.pdf (0.4 MB)


Abstract

Many object recognition systems are limited by their inability to share common parts or structure among related object classes. This capability is desirable because it allows information about parts and relationships in one object class to be generalized to other classes for which it is relevant. This ability has the potential to allow effective parameter learning from fewer examples and better generalization of the learned models to unseen instances, and it enables more efficient recognition. With this goal in mind, we have designed a representation and recognition framework that captures structural variability and shared part structure within and among object classes. The framework uses probabilistic geometric grammars (PGGs) to represent object classes recursively in terms of their parts, thereby exploiting the hierarchical and substitutive structure inherent to many types of objects. To incorporate geometric and appearance information, we extend traditional probabilistic context-free grammars to represent distributions over the relative geometric characteristics of object parts as well as the appearance of primitive parts. We describe an efficient dynamic programming algorithm for object categorization and localization in images given a PGG model. We also develop an EM algorithm to estimate the parameters of a grammar structure from training data, and a search-based structure learning approach that finds a compact grammar to explain the image data while sharing substructure among classes. Finally, we describe a set of experiments that demonstrate empirically that the system provides a performance benefit.


BibTeX - Entry

@InProceedings{aycinenalippow_et_al:DagSemProc.08091.9,
  author =	{Aycinena Lippow, Meg and Kaelbling, Leslie Pack and Lozano-Perez, Tomas},
  title =	{{Learning Grammatical Models for Object Recognition}},
  booktitle =	{Logic and Probability for Scene Interpretation},
  pages =	{1--15},
  series =	{Dagstuhl Seminar Proceedings (DagSemProc)},
  ISSN =	{1862-4405},
  year =	{2008},
  volume =	{8091},
  editor =	{Anthony G. Cohn and David C. Hogg and Ralf M\"{o}ller and Bernd Neumann},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2008/1611},
  URN =		{urn:nbn:de:0030-drops-16113},
  doi =		{10.4230/DagSemProc.08091.9},
  annote =	{Keywords: Object recognition, grammars, structure learning}
}

Keywords: Object recognition, grammars, structure learning
Collection: 08091 - Logic and Probability for Scene Interpretation
Issue Date: 2008
Date of publication: 23.10.2008


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI