Towards efficient human-machine collaboration: effects of gaze-driven feedback and engagement on performance

Mitev, Nikolina; Renner, Patrick; Pfeiffer, Thies; Staudte, Maria

Please use this identifier to cite or link to this item: doi:10.22028/D291-44285

Title:	Towards efficient human-machine collaboration: effects of gaze-driven feedback and engagement on performance
Author(s):	Mitev, Nikolina Renner, Patrick Pfeiffer, Thies Staudte, Maria
Language:	English
Title:	Cognitive research: principles and implications
Volume:	3
Issue:	1
Publisher/Platform:	Springer
Year of Publication:	2018
Free key words:	Human–computer interaction Natural language generation Listener gaze Referential success Multimodal systems
DDC notations:	400 Language, linguistics
Publikation type:	Journal Article
Abstract:	Referential success is crucial for collaborative task-solving in shared environments. In face-to-face interactions, humans, therefore, exploit speech, gesture, and gaze to identify a specific object. We investigate if and how the gaze behavior of a human interaction partner can be used by a gaze-aware assistance system to improve referential success. Specifically, our system describes objects in the real world to a human listener using on-the-fly speech generation. It continuously interprets listener gaze and implements alternative strategies to react to this implicit feedback. We used this system to investigate an optimal strategy for task performance: providing an unambiguous, longer instruction right from the beginning, or starting with a shorter, yet ambiguous instruction. Further, the system provides gaze-driven feedback, which could be either underspecified ("No, not that one!") or contrastive ("Further left!"). As expected, our results show that ambiguous instructions followed by underspecified feedback are not beneficial for task performance, whereas contrastive feedback results in faster interactions. Interestingly, this approach even outperforms unambiguous instructions (manipulation between subjects). However, when the system alternates between underspecified and contrastive feedback to initially ambiguous descriptions in an interleaved manner (within subjects), task performance is similar for both approaches. This suggests that listeners engage more intensely with the system when they can expect it to be cooperative. This, rather than the actual informativity of the spoken feedback, may determine the efficiency of information uptake and performance.
DOI of the first publication:	10.1186/s41235-018-0148-x
URL of the first publication:	https://cognitiveresearchjournal.springeropen.com/articles/10.1186/s41235-018-0148-x
Link to this record:	urn:nbn:de:bsz:291--ds-442858 hdl:20.500.11880/39618 http://dx.doi.org/10.22028/D291-44285
ISSN:	2365-7464
Date of registration:	10-Feb-2025
Faculty:	P - Philosophische Fakultät
Department:	P - Sprachwissenschaft und Sprachtechnologie
Professorship:	P - Keiner Professur zugeordnet
Collections:	SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:

File	Description	Size	Format
s41235-018-0148-x.pdf		4,11 MB	Adobe PDF	View/Open

Export: BibTex

This item is licensed under a Creative Commons License