License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.ICLP.2017.1
URN: urn:nbn:de:0030-drops-84629
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2018/8462/
Go to the corresponding OASIcs Volume Portal


Adrian, Weronika T. ; Manna, Marco ; Leone, Nicola ; Amendola, Giovanni ; Adrian, Marek

Entity set expansion from the Web via ASP

pdf-format:
OASIcs-ICLP-2017-1.pdf (0.3 MB)


Abstract

Knowledge on the Web in a large part is stored in various semantic resources that formalize, represent and organize it differently.
Combining information from several sources can improve results of tasks such as recognizing similarities among objects.
In this paper, we propose a logic-based method for the problem of entity set expansion (ESE), i.e. extending a list of named entities given a set of seeds.
This problem has relevant applications in the Information Extraction domain, specifically in automatic lexicon generation for dictionary-based annotating tools.
Contrary to typical approaches in natural languages processing, based on co-occurrence statistics of words, we determine the common category of the seeds by analyzing the semantic relations of the objects the words represent.
To do it, we integrate information from selected Web resources.
We introduce a notion of an entity network that uniformly represents the combined knowledge and allow to reason over it.
We show how to use the network to disambiguate word senses by relying on a concept of optimal common ancestor
and how to discover similarities between two entities.
Finally, we show how to expand a set of entities,
by using answer set programming with external predicates.

BibTeX - Entry

@InProceedings{adrian_et_al:OASIcs:2018:8462,
  author =	{Weronika T. Adrian and Marco Manna and Nicola Leone and Giovanni Amendola and Marek Adrian},
  title =	{{Entity set expansion from the Web via ASP}},
  booktitle =	{Technical Communications of the 33rd International Conference on Logic Programming (ICLP 2017)},
  pages =	{1:1--1:5},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-058-3},
  ISSN =	{2190-6807},
  year =	{2018},
  volume =	{58},
  editor =	{Ricardo Rocha and Tran Cao Son and Christopher Mears and Neda Saeedloei},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2018/8462},
  URN =		{urn:nbn:de:0030-drops-84629},
  doi =		{10.4230/OASIcs.ICLP.2017.1},
  annote =	{Keywords: answer set programming, entity set expansion, information extraction, natural language processing, word sense disambiguation}
}

Keywords: answer set programming, entity set expansion, information extraction, natural language processing, word sense disambiguation
Collection: Technical Communications of the 33rd International Conference on Logic Programming (ICLP 2017)
Issue Date: 2018
Date of publication: 14.02.2018


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI