License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/DagSemProc.08421.10
URN: urn:nbn:de:0030-drops-19344
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2009/1934/
Go to the corresponding Portal


Seidl, Thomas ; Müller, Emmanuel ; Assent, Ira ; Steinhausen, Uwe

Outlier detection and ranking based on subspace clustering

pdf-format:
08421.SeidlThomas.Paper.1934.pdf (0.1 MB)


Abstract

Detecting outliers is an important task for many applications
including fraud detection or consistency validation in real world
data. Particularly in the presence of uncertain data or imprecise data,
similar objects regularly deviate in their attribute values. The notion
of outliers has thus to be defined carefully. When considering outlier
detection as a task which is complementary to clustering, binary decisions
whether an object is regarded to be an outlier or not seem to be
near at hand. For high-dimensional data, however, objects may belong
to different clusters in different subspaces. More fine-grained concepts to
define outliers are therefore demanded. By our new OutRank approach,
we address outlier detection in heterogeneous high dimensional data and
propose a novel scoring function that provides a consistent model for
ranking outliers in the presence of different attribute types. Preliminary
experiments demonstrate the potential for successful detection and reasonable ranking of outliers in high dimensional data sets.

BibTeX - Entry

@InProceedings{seidl_et_al:DagSemProc.08421.10,
  author =	{Seidl, Thomas and M\"{u}ller, Emmanuel and Assent, Ira and Steinhausen, Uwe},
  title =	{{Outlier detection and ranking based on subspace clustering}},
  booktitle =	{Uncertainty Management in Information Systems},
  pages =	{1--4},
  series =	{Dagstuhl Seminar Proceedings (DagSemProc)},
  ISSN =	{1862-4405},
  year =	{2009},
  volume =	{8421},
  editor =	{Christoph Koch and Birgitta K\"{o}nig-Ries and Volker Markl and Maurice van Keulen},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2009/1934},
  URN =		{urn:nbn:de:0030-drops-19344},
  doi =		{10.4230/DagSemProc.08421.10},
  annote =	{Keywords: Outlier detection, outlier ranking, subspace clustering, data mining}
}

Keywords: Outlier detection, outlier ranking, subspace clustering, data mining
Collection: 08421 - Uncertainty Management in Information Systems
Issue Date: 2009
Date of publication: 24.03.2009


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI