License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.SLATE.2020.12
URN: urn:nbn:de:0030-drops-130252
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2020/13025/
Go to the corresponding OASIcs Volume Portal


Filipe, Soraia ; Batista, Fernando ; Ribeiro, Ricardo

Different Lexicon-Based Approaches to Emotion Identification in Portuguese Tweets (Short Paper)

pdf-format:
OASIcs-SLATE-2020-12.pdf (0.4 MB)


Abstract

This paper presents the existing literature on the identification of emotions and describes various lexica-based approaches and translation strategies to identify emotions in Portuguese tweets. A dataset of tweets was manually annotated to evaluate our classifier and also to assess the difficulty of the task. A lexicon-based approach was used in order to classify the presence or absence of eight different emotions in a tweet. Different strategies have been applied to refine and improve an existing and widely used lexicon, by means of automatic machine translation and aligned word embeddings. We tested six different classification approaches, exploring different ways of directly applying resources available for English by means of different translation strategies. The achieved results suggest that a better performance can be obtained both by improving a lexicon and by directly translating tweets into English and then applying an existing English lexicon.

BibTeX - Entry

@InProceedings{filipe_et_al:OASIcs:2020:13025,
  author =	{Soraia Filipe and Fernando Batista and Ricardo Ribeiro},
  title =	{{Different Lexicon-Based Approaches to Emotion Identification in Portuguese Tweets (Short Paper)}},
  booktitle =	{9th Symposium on Languages, Applications and Technologies (SLATE 2020)},
  pages =	{12:1--12:8},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-165-8},
  ISSN =	{2190-6807},
  year =	{2020},
  volume =	{83},
  editor =	{Alberto Sim{\~o}es and Pedro Rangel Henriques and Ricardo Queir{\'o}s},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2020/13025},
  URN =		{urn:nbn:de:0030-drops-130252},
  doi =		{10.4230/OASIcs.SLATE.2020.12},
  annote =	{Keywords: Emotion detection, tweets, Portuguese Language, Emotion lexicon}
}

Keywords: Emotion detection, tweets, Portuguese Language, Emotion lexicon
Collection: 9th Symposium on Languages, Applications and Technologies (SLATE 2020)
Issue Date: 2020
Date of publication: 16.09.2020


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI