License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.SLATE.2020.3
URN: urn:nbn:de:0030-drops-130164
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2020/13016/
Go to the corresponding OASIcs Volume Portal


Pinto, Afonso ; Moniz, Helena ; Batista, Fernando

Detection of Emerging Words in Portuguese Tweets

pdf-format:
OASIcs-SLATE-2020-3.pdf (1 MB)


Abstract

This paper tackles the problem of detecting emerging words on a language, based on social networks content. It proposes an approach for detecting new words on Twitter, and reports the achieved results for a collection of 8 million Portuguese tweets. This study uses geolocated tweets, collected between January 2018 and June 2019, and written in the Portuguese territory. The first six months of the data were used to define an initial vocabulary on known words, and the following 12 months were used for identifying new words, thus testing our approach. The set of resulting words were manually analyzed, revealing a number of distinct events, and suggesting that Twitter may be a valuable resource for researching neology, and the dynamics of a language.

BibTeX - Entry

@InProceedings{pinto_et_al:OASIcs:2020:13016,
  author =	{Afonso Pinto and Helena Moniz and Fernando Batista},
  title =	{{Detection of Emerging Words in Portuguese Tweets}},
  booktitle =	{9th Symposium on Languages, Applications and Technologies (SLATE 2020)},
  pages =	{3:1--3:10},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-165-8},
  ISSN =	{2190-6807},
  year =	{2020},
  volume =	{83},
  editor =	{Alberto Sim{\~o}es and Pedro Rangel Henriques and Ricardo Queir{\'o}s},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2020/13016},
  URN =		{urn:nbn:de:0030-drops-130164},
  doi =		{10.4230/OASIcs.SLATE.2020.3},
  annote =	{Keywords: Emerging words, Twitter, Portuguese language}
}

Keywords: Emerging words, Twitter, Portuguese language
Collection: 9th Symposium on Languages, Applications and Technologies (SLATE 2020)
Issue Date: 2020
Date of publication: 16.09.2020


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI