License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.SLATE.2022.9
URN: urn:nbn:de:0030-drops-167555
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2022/16755/
Caldeira, Francisco ;
Nunes, Luís ;
Ribeiro, Ricardo
Classification of Public Administration Complaints
Abstract
Complaint management is a problem faced by many organizations that is both vital to customer image and highly dependent on human resources. This work attempts to tackle a part of the problem, by classifying summaries of complaints using machine learning models in order to better redirect these to the appropriate responders. The main challenges of this task is that training datasets are often small and highly imbalanced. This can can have a big impact on the performance of classification models. The dataset analyzed in this work suffers from both of these problems, being relatively small and having labels in different proportions. In this work, two different techniques are analyzed: combining classes together to increase the number of elements of the new class; and, providing new artificial examples for some classes via translation into other languages. The classification models explored were the following: k-NN, SVM, Naïve Bayes, boosting, and Deep Learning approaches, including transformers. The paper concludes that although, as expected, the classes with little representation are hard to classify, the techniques explored helped to boost the performance, especially in the classes with a low number of elements. SVM and BERT-based models outperformed their peers.
BibTeX - Entry
@InProceedings{caldeira_et_al:OASIcs.SLATE.2022.9,
author = {Caldeira, Francisco and Nunes, Lu{\'\i}s and Ribeiro, Ricardo},
title = {{Classification of Public Administration Complaints}},
booktitle = {11th Symposium on Languages, Applications and Technologies (SLATE 2022)},
pages = {9:1--9:12},
series = {Open Access Series in Informatics (OASIcs)},
ISBN = {978-3-95977-245-7},
ISSN = {2190-6807},
year = {2022},
volume = {104},
editor = {Cordeiro, Jo\~{a}o and Pereira, Maria Jo\~{a}o and Rodrigues, Nuno F. and Pais, Sebasti\~{a}o},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/opus/volltexte/2022/16755},
URN = {urn:nbn:de:0030-drops-167555},
doi = {10.4230/OASIcs.SLATE.2022.9},
annote = {Keywords: Text Classification, Natural Language Processing, Deep Learning, BERT}
}
Keywords: |
|
Text Classification, Natural Language Processing, Deep Learning, BERT |
Collection: |
|
11th Symposium on Languages, Applications and Technologies (SLATE 2022) |
Issue Date: |
|
2022 |
Date of publication: |
|
27.07.2022 |