License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.SoCG.2016.34
URN: urn:nbn:de:0030-drops-59264
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2016/5926/
Ding, Hu ;
Gao, Jing ;
Xu, Jinhui
Finding Global Optimum for Truth Discovery: Entropy Based Geometric Variance
Abstract
Truth Discovery is an important problem arising in data analytics related fields such as data mining, database, and big data. It concerns about finding the most trustworthy information from a dataset acquired from a number of unreliable sources. Due to its importance, the problem has been extensively studied in recent years and a number techniques have already been proposed. However, all of them are of heuristic nature and do not have any quality guarantee. In this paper, we formulate the problem as a high dimensional geometric optimization problem, called Entropy based Geometric Variance. Relying on a number of novel geometric techniques (such as Log-Partition and Modified Simplex Lemma), we further discover new insights to this problem. We show, for the first time, that the truth discovery problem can be solved with guaranteed quality of solution. Particularly, we show that it is possible to achieve a (1+eps)-approximation within nearly linear time under some reasonable assumptions. We expect that our algorithm will be useful for other data related applications.
BibTeX - Entry
@InProceedings{ding_et_al:LIPIcs:2016:5926,
author = {Hu Ding and Jing Gao and Jinhui Xu},
title = {{Finding Global Optimum for Truth Discovery: Entropy Based Geometric Variance}},
booktitle = {32nd International Symposium on Computational Geometry (SoCG 2016)},
pages = {34:1--34:16},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-009-5},
ISSN = {1868-8969},
year = {2016},
volume = {51},
editor = {S{\'a}ndor Fekete and Anna Lubiw},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {http://drops.dagstuhl.de/opus/volltexte/2016/5926},
URN = {urn:nbn:de:0030-drops-59264},
doi = {10.4230/LIPIcs.SoCG.2016.34},
annote = {Keywords: geometric optimization, data mining, high dimension, entropy}
}
Keywords: |
|
geometric optimization, data mining, high dimension, entropy |
Collection: |
|
32nd International Symposium on Computational Geometry (SoCG 2016) |
Issue Date: |
|
2016 |
Date of publication: |
|
10.06.2016 |