License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.LDK.2019.21
URN: urn:nbn:de:0030-drops-103856
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2019/10385/
Go to the corresponding OASIcs Volume Portal


Gromann, Dagmar ; Declerck, Thierry

Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology

pdf-format:
OASIcs-LDK-2019-21.pdf (0.4 MB)


Abstract

Semantic shifts caused by derivational morphemes is a common subject of investigation in language modeling, while inflectional morphemes are frequently portrayed as semantically more stable. This study is motivated by the previously established observation that inflectional morphemes can be just as variable as derivational ones. For instance, the English plural "-s" can turn the fabric silk into the garments of a jockey, silks. While humans know that silk in this sense has no plural, it takes more for machines to arrive at this conclusion. Frequently utilized computational language resources, such as WordNet, or models for representing computational lexicons, like OntoLex-Lemon, have no descriptive mechanism to represent such inflectional semantic shifts. To investigate this phenomenon, we extract word pairs of different grammatical number from WordNet that feature additional senses in the plural and evaluate their distribution in vector space, i.e., pre-trained word2vec and fastText embeddings. We then propose an extension of OntoLex-Lemon to accommodate this phenomenon that we call inflectional morpho-semantic variation to provide a formal representation accessible to algorithms, neural networks, and agents. While the exact scope of the problem is yet to be determined, this first dataset shows that it is not negligible.

BibTeX - Entry

@InProceedings{gromann_et_al:OASIcs:2019:10385,
  author =	{Dagmar Gromann and Thierry Declerck},
  title =	{{Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology}},
  booktitle =	{2nd Conference on Language, Data and Knowledge (LDK 2019)},
  pages =	{21:1--21:15},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-105-4},
  ISSN =	{2190-6807},
  year =	{2019},
  volume =	{70},
  editor =	{Maria Eskevich and Gerard de Melo and Christian F{\"a}th and John P. McCrae and Paul Buitelaar and Christian Chiarcos and Bettina Klimek and Milan Dojchinovski},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2019/10385},
  URN =		{urn:nbn:de:0030-drops-103856},
  doi =		{10.4230/OASIcs.LDK.2019.21},
  annote =	{Keywords: Inflectional morphology, semantic shift, embeddings, formal lexical modeling}
}

Keywords: Inflectional morphology, semantic shift, embeddings, formal lexical modeling
Collection: 2nd Conference on Language, Data and Knowledge (LDK 2019)
Issue Date: 2019
Date of publication: 16.05.2019


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI