License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.LDK.2019.15
URN: urn:nbn:de:0030-drops-103791
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2019/10379/
Go to the corresponding OASIcs Volume Portal


Yaman, Beyza ; Pasin, Michele ; Freudenberg, Markus

Interlinking SciGraph and DBpedia Datasets Using Link Discovery and Named Entity Recognition Techniques

pdf-format:
OASIcs-LDK-2019-15.pdf (0.8 MB)


Abstract

In recent years we have seen a proliferation of Linked Open Data (LOD) compliant datasets becoming available on the web, leading to an increased number of opportunities for data consumers to build smarter applications which integrate data coming from disparate sources. However, often the integration is not easily achievable since it requires discovering and expressing associations across heterogeneous data sets. The goal of this work is to increase the discoverability and reusability of the scholarly data by integrating them to highly interlinked datasets in the LOD cloud. In order to do so we applied techniques that a) improve the identity resolution across these two sources using Link Discovery for the structured data (i.e. by annotating Springer Nature (SN) SciGraph entities with links to DBpedia entities), and b) enriching SN SciGraph unstructured text content (document abstracts) with links to DBpedia entities using Named Entity Recognition (NER). We published the results of this work using standard vocabularies and provided an interactive exploration tool which presents the discovered links w.r.t. the breadth and depth of the DBpedia classes.

BibTeX - Entry

@InProceedings{yaman_et_al:OASIcs:2019:10379,
  author =	{Beyza Yaman and Michele Pasin and Markus Freudenberg},
  title =	{{Interlinking SciGraph and DBpedia Datasets Using Link Discovery and Named Entity Recognition Techniques}},
  booktitle =	{2nd Conference on Language, Data and Knowledge (LDK 2019)},
  pages =	{15:1--15:8},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-105-4},
  ISSN =	{2190-6807},
  year =	{2019},
  volume =	{70},
  editor =	{Maria Eskevich and Gerard de Melo and Christian F{\"a}th and John P. McCrae and Paul Buitelaar and Christian Chiarcos and Bettina Klimek and Milan Dojchinovski},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2019/10379},
  URN =		{urn:nbn:de:0030-drops-103791},
  doi =		{10.4230/OASIcs.LDK.2019.15},
  annote =	{Keywords: Linked Data, Named Entity Recognition, Link Discovery, Interlinking}
}

Keywords: Linked Data, Named Entity Recognition, Link Discovery, Interlinking
Collection: 2nd Conference on Language, Data and Knowledge (LDK 2019)
Issue Date: 2019
Date of publication: 16.05.2019


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI