License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.GISCIENCE.2018.49
URN: urn:nbn:de:0030-drops-93778
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2018/9377/
Go to the corresponding LIPIcs Volume Portal


Min, Kyunghyun ; Lee, Jungseok ; Yu, Kiyun ; Kim, Jiyoung

Geotagging Location Information Extracted from Unstructured Data (Short Paper)

pdf-format:
LIPIcs-GISCIENCE-2018-49.pdf (0.3 MB)


Abstract

Location information is an essential element of location-based services and is used in various ways. Unstructured data contain different types of location information, but coordinate values are required to determine the exact location. In Twitter, a typical social network service (SNS) platform of unstructured data, the number of geotagged tweets is low. If we can estimate the location of text by geotagging a large number of unstructured data, we can estimate the location of the event in real-time. This study is a base study on extracting the location information by using the named entity recognizer provided by the Exobrain API and applying geotagging to unstructured data in Hangul (Korean). We used Chosun news articles, which are grammatically correct and well organized, instead of tweets to extract three location-related categories, namely "location," "organization," and "artifact". We used the named entity recognizer and geotagged each sentence in combination of the fields in each category. The results of the study showed that 61% of the 800 test sentences did not have the location-related information, thus hindering geotagging. In 11.75% of the test sentences, geotagging was possible with only the given location information extracted using the named entity recognizer. The remaining 27.25% of the sentences contained information on more than two locations from the same subcategories and hence required location estimation from candidate locations. In future research, we plan to apply the results of this study to develop location estimation algorithm that makes use of the extracted location-related entities from purely unstructured data such as that on SNSs.

BibTeX - Entry

@InProceedings{min_et_al:LIPIcs:2018:9377,
  author =	{Kyunghyun Min and Jungseok Lee and Kiyun Yu and Jiyoung Kim},
  title =	{{Geotagging Location Information Extracted from Unstructured Data (Short Paper)}},
  booktitle =	{10th International Conference on Geographic Information  Science (GIScience 2018)},
  pages =	{49:1--49:6},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-083-5},
  ISSN =	{1868-8969},
  year =	{2018},
  volume =	{114},
  editor =	{Stephan Winter and Amy Griffin and Monika Sester},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2018/9377},
  URN =		{urn:nbn:de:0030-drops-93778},
  doi =		{10.4230/LIPIcs.GISCIENCE.2018.49},
  annote =	{Keywords: Location Estimation, Information Extraction, Geo-Tagging, Location Information, Unstructured Data}
}

Keywords: Location Estimation, Information Extraction, Geo-Tagging, Location Information, Unstructured Data
Collection: 10th International Conference on Geographic Information Science (GIScience 2018)
Issue Date: 2018
Date of publication: 02.08.2018


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI