Please use this identifier to cite or link to this item: doi:10.22028/D291-44285
Title: Towards efficient human-machine collaboration: effects of gaze-driven feedback and engagement on performance
Author(s): Mitev, Nikolina
Renner, Patrick
Pfeiffer, Thies
Staudte, Maria
Language: English
Title: Cognitive research: principles and implications
Volume: 3
Issue: 1
Publisher/Platform: Springer
Year of Publication: 2018
Free key words: Human–computer interaction
Natural language generation
Listener gaze
Referential success
Multimodal systems
DDC notations: 400 Language, linguistics
Publikation type: Journal Article
Abstract: Referential success is crucial for collaborative task-solving in shared environments. In face-to-face interactions, humans, therefore, exploit speech, gesture, and gaze to identify a specific object. We investigate if and how the gaze behavior of a human interaction partner can be used by a gaze-aware assistance system to improve referential success. Specifically, our system describes objects in the real world to a human listener using on-the-fly speech generation. It continuously interprets listener gaze and implements alternative strategies to react to this implicit feedback. We used this system to investigate an optimal strategy for task performance: providing an unambiguous, longer instruction right from the beginning, or starting with a shorter, yet ambiguous instruction. Further, the system provides gaze-driven feedback, which could be either underspecified ("No, not that one!") or contrastive ("Further left!"). As expected, our results show that ambiguous instructions followed by underspecified feedback are not beneficial for task performance, whereas contrastive feedback results in faster interactions. Interestingly, this approach even outperforms unambiguous instructions (manipulation between subjects). However, when the system alternates between underspecified and contrastive feedback to initially ambiguous descriptions in an interleaved manner (within subjects), task performance is similar for both approaches. This suggests that listeners engage more intensely with the system when they can expect it to be cooperative. This, rather than the actual informativity of the spoken feedback, may determine the efficiency of information uptake and performance.
DOI of the first publication: 10.1186/s41235-018-0148-x
URL of the first publication: https://cognitiveresearchjournal.springeropen.com/articles/10.1186/s41235-018-0148-x
Link to this record: urn:nbn:de:bsz:291--ds-442858
hdl:20.500.11880/39618
http://dx.doi.org/10.22028/D291-44285
ISSN: 2365-7464
Date of registration: 10-Feb-2025
Faculty: P - Philosophische Fakultät
Department: P - Sprachwissenschaft und Sprachtechnologie
Professorship: P - Keiner Professur zugeordnet
Collections:SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:
File Description SizeFormat 
s41235-018-0148-x.pdf4,11 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons