License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.SLATE.2014.77
URN: urn:nbn:de:0030-drops-45609
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2014/4560/
Go to the corresponding OASIcs Volume Portal


Brito, Rui ; Almeida, José João

A Workflow Description Language to Orchestrate Multi-Lingual Resources

pdf-format:
8.pdf (0.4 MB)


Abstract

Texts aligned alongside their translation, or Parallel Corpora, are a very widely used resource in Computational Linguistics. Processing these resources, however, is a very intensive, time consuming task, which makes it a suitable case study for High Performance Computing (HPC).

HPC underwent several recent changes, with the evolution of Heterogeneous Platforms, where multiple devices with different architectures are able to share workload to increase performance.

Several frameworks/toolkits have been under development, in various fields, to aid the programmer in extracting more performance from these platforms. Either by dynamically scheduling the workload across the available resources or by exploring the opportunities for parallelism. However, there is no toolkit targeted at Computational Linguistics, more specifically, Parallel Corpora processing. Parallel Corpora processing can be a very time consuming task, and the field could definitely use a toolkit which aids the programmer in achieving not only better performance, but also a convenient and expressive way of specifying tasks and their dependencies.

BibTeX - Entry

@InProceedings{brito_et_al:OASIcs:2014:4560,
  author =	{Rui Brito and Jos{\'e} Jo{\~a}o Almeida},
  title =	{{A Workflow Description Language to Orchestrate Multi-Lingual Resources}},
  booktitle =	{3rd Symposium on Languages, Applications and Technologies},
  pages =	{77--83},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-68-2},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{38},
  editor =	{Maria Jo{\~a}o Varanda Pereira and Jos{\'e} Paulo Leal and Alberto Sim{\~o}es},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2014/4560},
  URN =		{urn:nbn:de:0030-drops-45609},
  doi =		{10.4230/OASIcs.SLATE.2014.77},
  annote =	{Keywords: workflow, orchestration, parallelism, domain specific languages, corpora}
}

Keywords: workflow, orchestration, parallelism, domain specific languages, corpora
Collection: 3rd Symposium on Languages, Applications and Technologies
Issue Date: 2014
Date of publication: 18.06.2014


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI