License: Creative Commons Attribution 4.0 International license (CC BY 4.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.LDK.2021.6
URN: urn:nbn:de:0030-drops-145424
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2021/14542/
Go to the corresponding OASIcs Volume Portal


Weber, Tobias

Mind the Gap: Language Data, Their Producers, and the Scientific Process (Crazy New Idea)

pdf-format:
OASIcs-LDK-2021-6.pdf (0.5 MB)


Abstract

This paper discusses the role of low-resource languages in NLP through the lens of different stakeholders. It argues that the current "consumerist approach" to language data reinforces a vicious circle which increases the technological exclusion of minority communities. Researchers' decisions directly affect these processes to the detriment of minorities and practitioners engaging in language work in these communities. In line with the conference topic, the paper concludes with strategies and prerequisites for creating a positive feedback loop in our research benefiting language work within the next decade.

BibTeX - Entry

@InProceedings{weber:OASIcs.LDK.2021.6,
  author =	{Weber, Tobias},
  title =	{{Mind the Gap: Language Data, Their Producers, and the Scientific Process}},
  booktitle =	{3rd Conference on Language, Data and Knowledge (LDK 2021)},
  pages =	{6:1--6:9},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-199-3},
  ISSN =	{2190-6807},
  year =	{2021},
  volume =	{93},
  editor =	{Gromann, Dagmar and S\'{e}rasset, Gilles and Declerck, Thierry and McCrae, John P. and Gracia, Jorge and Bosque-Gil, Julia and Bobillo, Fernando and Heinisch, Barbara},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2021/14542},
  URN =		{urn:nbn:de:0030-drops-145424},
  doi =		{10.4230/OASIcs.LDK.2021.6},
  annote =	{Keywords: minority languages, data integration, sociology of technology, documentary linguistics, exclusion}
}

Keywords: minority languages, data integration, sociology of technology, documentary linguistics, exclusion
Collection: 3rd Conference on Language, Data and Knowledge (LDK 2021)
Issue Date: 2021
Date of publication: 30.08.2021


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI