Please use this identifier to cite or link to this item: doi:10.22028/D291-32066
Volltext verfügbar? / Dokumentlieferung
Title: Differences in Gradient Emotion Perception: Human vs. Alexa Voices
Author(s): Cohn, Michelle
Raveh, Eran
Predeck, Kristin
Gessinger, Iona
Möbius, Bernd
Zellou, Georgia
Language: English
Title: Cognitive intelligence for speech processing : 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) : held online due to Covid-19 : Shanghai, China, 25-29 October 2020
Startpage: 1818
Endpage: 1822
Publisher/Platform: Curran Associates, Inc.
Year of Publication: 2020
Place of publication: Red Hook, NY
Title of the Conference: Interspeech 2020
Place of the conference: Shanghai, China
Publikation type: Conference Paper
Abstract: The present study compares how individuals perceive gradient acoustic realizations of emotion produced by a human voice versus an Amazon Alexa text-to-speech (TTS) voice. We manipulated semantically neutral sentences spoken by both talkers with identical emotional synthesis methods, using three levels of increasing ‘happiness’ (0%, 33%, 66% ‘happier’). On each trial, listeners (native speakers of American English, n=99) rated a given sentence on two scales to assess dimensions of emotion: valence (negative-positive) and arousal (calm-excited). Participants also rated the Alexa voice on several parameters to assess anthropomorphism (e.g., naturalness, human-likeness, etc.). Results showed that the emotion manipulations led to increases in perceived positive valence and excitement. Yet, the effect differed by interlocutor: increasing ‘happiness’ manipulations led to larger changes for the human voice than the Alexa voice. Additionally, we observed individual differences in perceived valence/arousal based on participants’ anthropomorphism scores. Overall, this line of research can speak to theories of computer personification and elucidate our changing relationship with voice-AI technology.
DOI of the first publication: 10.21437/Interspeech.2020-1938
URL of the first publication: https://www.isca-speech.org/archive/Interspeech_2020/abstracts/1938.html
Link to this record: hdl:20.500.11880/30650
http://dx.doi.org/10.22028/D291-32066
ISBN: 978-1-7138-2069-7
Date of registration: 17-Feb-2021
Notes: Volume 3
Faculty: P - Philosophische Fakultät
Department: P - Sprachwissenschaft und Sprachtechnologie
Professorship: P - Prof. Dr. Bernd Möbius
Collections:SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:
There are no files associated with this item.


Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.