Please use this identifier to cite or link to this item:
Volltext verfügbar? / Dokumentlieferung
doi:10.22028/D291-32066
Title: | Differences in Gradient Emotion Perception: Human vs. Alexa Voices |
Author(s): | Cohn, Michelle Raveh, Eran Predeck, Kristin Gessinger, Iona Möbius, Bernd Zellou, Georgia |
Language: | English |
Title: | Cognitive intelligence for speech processing : 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) : held online due to Covid-19 : Shanghai, China, 25-29 October 2020 |
Startpage: | 1818 |
Endpage: | 1822 |
Publisher/Platform: | Curran Associates, Inc. |
Year of Publication: | 2020 |
Place of publication: | Red Hook, NY |
Title of the Conference: | Interspeech 2020 |
Place of the conference: | Shanghai, China |
Publikation type: | Conference Paper |
Abstract: | The present study compares how individuals perceive gradient acoustic realizations of emotion produced by a human voice versus an Amazon Alexa text-to-speech (TTS) voice. We manipulated semantically neutral sentences spoken by both talkers with identical emotional synthesis methods, using three levels of increasing ‘happiness’ (0%, 33%, 66% ‘happier’). On each trial, listeners (native speakers of American English, n=99) rated a given sentence on two scales to assess dimensions of emotion: valence (negative-positive) and arousal (calm-excited). Participants also rated the Alexa voice on several parameters to assess anthropomorphism (e.g., naturalness, human-likeness, etc.). Results showed that the emotion manipulations led to increases in perceived positive valence and excitement. Yet, the effect differed by interlocutor: increasing ‘happiness’ manipulations led to larger changes for the human voice than the Alexa voice. Additionally, we observed individual differences in perceived valence/arousal based on participants’ anthropomorphism scores. Overall, this line of research can speak to theories of computer personification and elucidate our changing relationship with voice-AI technology. |
DOI of the first publication: | 10.21437/Interspeech.2020-1938 |
URL of the first publication: | https://www.isca-speech.org/archive/Interspeech_2020/abstracts/1938.html |
Link to this record: | hdl:20.500.11880/30650 http://dx.doi.org/10.22028/D291-32066 |
ISBN: | 978-1-7138-2069-7 |
Date of registration: | 17-Feb-2021 |
Notes: | Volume 3 |
Faculty: | P - Philosophische Fakultät |
Department: | P - Sprachwissenschaft und Sprachtechnologie |
Professorship: | P - Prof. Dr. Bernd Möbius |
Collections: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Files for this record:
There are no files associated with this item.
Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.