Please use this identifier to cite or link to this item:
Volltext verfügbar? / Dokumentlieferung
doi:10.22028/D291-32058
Title: | Shadowing Synthesized Speech — Segmental Analysis of Phonetic Convergence |
Author(s): | Gessinger, Iona Raveh, Eran Le Maguer, Sébastien Möbius, Bernd Steiner, Ingmar |
Language: | English |
Title: | Situated interaction : 18th Annual Conference of the International Speech Communication Association |
Startpage: | 3797 |
Endpage: | 3801 |
Publisher/Platform: | Curran Associates, Inc. |
Year of Publication: | 2018 |
Place of publication: | Red Hook, NY |
Title of the Conference: | Interspeech 2017 |
Place of the conference: | Stockholm, Sweden |
Publikation type: | Conference Paper |
Abstract: | To shed light on the question whether humans converge phonetically to synthesized speech, a shadowing experiment was conducted using three different types of stimuli — natural speaker, diphone synthesis, and HMM synthesis. Three segment-level phonetic features of German that are well-known to vary across native speakers were examined. The first feature triggered convergence in roughly one third of the cases for all stimulus types. The second feature showed generally a small amount of convergence, which may be due to the nature of the feature itself. Still the effect was strongest for the natural stimuli, followed by the HMM stimuli and weakest for the diphone stimuli. The effect of the third feature was clearly observable for the natural stimuli and less pronounced in the synthetic stimuli. This is presumably a result of the partly insufficient perceptibility of this target feature in the synthetic stimuli and demonstrates the necessity of gaining fine-grained control over the synthesis output, should it be intended to implement capabilities of phonetic convergence on the segmental level in spoken dialogue systems. |
DOI of the first publication: | 10.21437/Interspeech.2017-1433 |
URL of the first publication: | https://www.isca-speech.org/archive/Interspeech_2017/abstracts/1433.html |
Link to this record: | hdl:20.500.11880/29623 http://dx.doi.org/10.22028/D291-32058 |
ISBN: | 978-1-5108-4876-4 |
Date of registration: | 1-Sep-2020 |
Notes: | volume 6 |
Faculty: | P - Philosophische Fakultät |
Department: | P - Sprachwissenschaft und Sprachtechnologie |
Professorship: | P - Prof. Dr. Bernd Möbius |
Collections: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Files for this record:
There are no files associated with this item.
Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.