License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.SLATE.2018.17
URN: urn:nbn:de:0030-drops-92755
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2018/9275/
Go to the corresponding OASIcs Volume Portal


Silva, Sara ; Ribeiro, Ricardo ; Pereira, Rubén

Less is more in incident categorization (Short Paper)

pdf-format:
OASIcs-SLATE-2018-17.pdf (0.3 MB)


Abstract

The IT incident management process requires a correct categorization to attribute incident tickets to the right resolution group and obtain as quickly as possible an operational system, impacting the minimum as possible the business and costumers. In this work, we introduce automatic text classification, demonstrating the application of several natural language processing techniques and analyzing the impact of each one on a real incident tickets dataset. The techniques that we explore in the pre-processing of the text that describes an incident are the following: tokenization, stemming, eliminating stop-words, named-entity recognition, and TF xIDF-based document representation. Finally, to build the model and observe the results after applying the previous techniques, we use two machine learning algorithms: Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). Two important findings result from this study: a shorter description of an incident is better than a full description of an incident; and, pre-processing has little impact on incident categorization, mainly due the specific vocabulary used in this type of text.

BibTeX - Entry

@InProceedings{silva_et_al:OASIcs:2018:9275,
  author =	{Sara Silva and Ricardo Ribeiro and Rub{\'e}n Pereira},
  title =	{{Less is more in incident categorization (Short Paper)}},
  booktitle =	{7th Symposium on Languages, Applications and Technologies  (SLATE 2018)},
  pages =	{17:1--17:7},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-072-9},
  ISSN =	{2190-6807},
  year =	{2018},
  volume =	{62},
  editor =	{Pedro Rangel Henriques and Jos{\'e} Paulo Leal and Ant{\'o}nio Menezes Leit{\~a}o and Xavier G{\'o}mez Guinovart},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2018/9275},
  URN =		{urn:nbn:de:0030-drops-92755},
  doi =		{10.4230/OASIcs.SLATE.2018.17},
  annote =	{Keywords: machine learning, automated incident categorization, SVM, incident management, natural language}
}

Keywords: machine learning, automated incident categorization, SVM, incident management, natural language
Collection: 7th Symposium on Languages, Applications and Technologies (SLATE 2018)
Issue Date: 2018
Date of publication: 13.07.2018


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI