License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.LDK.2021.16
URN: urn:nbn:de:0030-drops-145528
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2021/14552/
Go to the corresponding OASIcs Volume Portal


Kirillovich, Alexander ; Shaekhov, Marat ; Galieva, Alfiya ; Nevzorova, Olga ; Ilvovsky, Dmitry ; Loukachevitch, Natalia

TatWordNet: A Linguistic Linked Open Data-Integrated WordNet Resource for Tatar

pdf-format:
OASIcs-LDK-2021-16.pdf (0.7 MB)


Abstract

We present the first release of TatWordNet (http://wordnet.tatar), a wordnet resource for Tatar. TatWordNet has been constructed by the combination of the expand and the merge approaches. The synsets of TatWordNet have been compiled by: (i) the automatic conversion of concepts of TatThes, a socio-political Tatar; (ii) semi-automatic translation of synsets of RuWordNet, a wordnet resource for Russian with the followed manual verification and correction; (iii) manual translation of base RuWordNet synsets; (iv) and manual translation of the all hypernyms of the previously translated RuWordNet synsets. The currents version of TatWordNet contains 18,583 synsets, 36,540 lexical entries and 49,525 senses. The resource has been published to the Linguistic Linked Open Data cloud and interlinked with the Global WordNet Grid.

BibTeX - Entry

@InProceedings{kirillovich_et_al:OASIcs.LDK.2021.16,
  author =	{Kirillovich, Alexander and Shaekhov, Marat and Galieva, Alfiya and Nevzorova, Olga and Ilvovsky, Dmitry and Loukachevitch, Natalia},
  title =	{{TatWordNet: A Linguistic Linked Open Data-Integrated WordNet Resource for Tatar}},
  booktitle =	{3rd Conference on Language, Data and Knowledge (LDK 2021)},
  pages =	{16:1--16:12},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-199-3},
  ISSN =	{2190-6807},
  year =	{2021},
  volume =	{93},
  editor =	{Gromann, Dagmar and S\'{e}rasset, Gilles and Declerck, Thierry and McCrae, John P. and Gracia, Jorge and Bosque-Gil, Julia and Bobillo, Fernando and Heinisch, Barbara},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2021/14552},
  URN =		{urn:nbn:de:0030-drops-145528},
  doi =		{10.4230/OASIcs.LDK.2021.16},
  annote =	{Keywords: Linguistic Linked Open Data, WordNet, Thesaurus, Tatar language}
}

Keywords: Linguistic Linked Open Data, WordNet, Thesaurus, Tatar language
Collection: 3rd Conference on Language, Data and Knowledge (LDK 2021)
Issue Date: 2021
Date of publication: 30.08.2021
Supplementary Material: Dataset: http://wordnet.tatar/


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI