License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.SLATE.2017.21
URN: urn:nbn:de:0030-drops-79541
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2017/7954/
Pascoal, Rui ;
Ribeiro, Ricardo ;
Batista, Fernando ;
de Almeida, Ana
Adapting Speech Recognition in Augmented Reality for Mobile Devices in Outdoor Environments
Abstract
This paper describes the process of integrating automatic speech recognition (ASR) into a mobile application and explores the benefits and challenges of integrating speech with augmented reality (AR) in outdoor environments. The augmented reality allows end-users to interact with the information displayed and perform tasks, while increasing the user’s perception about the real world by adding virtual information to it. Speech is the most natural way of communication: it allows hands-free interaction and may allow end-users to quickly and easily access a range of features available. Speech recognition technology is often available in most of the current mobile devices, but it often uses Internet to receive the corresponding transcript from remote servers, e.g., Google speech recognition. However, in some outdoor environments, Internet is not always available or may be offered at poor quality. We integrated an off-line automatic speech recognition module into an AR application for outdoor usage that does not require Internet. Currently, speech interaction is used within the application to access five different features, namely: to take a photo, shoot a film, communicate, messaging related tasks, and to request information, either geographic, biometric, or climatic. The application makes available solutions to manage and interact with the mobile device, offering good usability. We have compared the online and off-line speech recognition systems in order to assess their adequacy to the tasks. Both systems were tested under different conditions, commonly found in outdoor environments, such as: Internet access quality, presence of noise, and distractions.
BibTeX - Entry
@InProceedings{pascoal_et_al:OASIcs:2017:7954,
author = {Rui Pascoal and Ricardo Ribeiro and Fernando Batista and Ana de Almeida},
title = {{Adapting Speech Recognition in Augmented Reality for Mobile Devices in Outdoor Environments}},
booktitle = {6th Symposium on Languages, Applications and Technologies (SLATE 2017)},
pages = {21:1--21:14},
series = {OpenAccess Series in Informatics (OASIcs)},
ISBN = {978-3-95977-056-9},
ISSN = {2190-6807},
year = {2017},
volume = {56},
editor = {Ricardo Queir{\'o}s and M{\'a}rio Pinto and Alberto Sim{\~o}es and Jos{\'e} Paulo Leal and Maria Jo{\~a}o Varanda},
publisher = {Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {http://drops.dagstuhl.de/opus/volltexte/2017/7954},
URN = {urn:nbn:de:0030-drops-79541},
doi = {10.4230/OASIcs.SLATE.2017.21},
annote = {Keywords: Speech Recognition, Natural Language Processing, Sphinx for Mobile Devices, Augmented Reality, Outdoor Environments}
}
Keywords: |
|
Speech Recognition, Natural Language Processing, Sphinx for Mobile Devices, Augmented Reality, Outdoor Environments |
Collection: |
|
6th Symposium on Languages, Applications and Technologies (SLATE 2017) |
Issue Date: |
|
2017 |
Date of publication: |
|
04.10.2017 |