Observations on the dynamic control of an articulatory synthesizer using speech production data

Steiner, Ingmar Michael Augustus

Please use this identifier to cite or link to this item: doi:10.22028/D291-23547

Title:	Observations on the dynamic control of an articulatory synthesizer using speech production data
Other Titles:	Betrachtungen zur dynamischen Steuerung eines artikulatorischen Synthesizers mit Hilfe von Sprachproduktionsdaten
Author(s):	Steiner, Ingmar Michael Augustus
Language:	English
Year of Publication:	2010
SWD key words:	Sprachsynthese Sprachproduktion Elektromagnetische Artikulographie Dynamische Optimierung
Free key words:	Artikulatorische Sprachsynthese Vokaltrakt articulatory speech synthesis speech production electromagnetic articulography dynamic optimization vocal tract gestural score
DDC notations:	400 Language, linguistics
Publikation type:	Dissertation
Abstract:	This dissertation explores the automatic generation of gestural score based control structures for a three-dimensional articulatory speech synthesizer. The gestural scores are optimized in an articulatory resynthesis paradigm using a dynamic programming algorithm and a cost function which measures the deviation from a gold standard in the form of natural speech production data. This data had been recorded using electromagnetic articulography, from the same speaker to which the synthesizer's vocal tract model had previously been adapted. Future work to create an English voice for the synthesizer and integrate it into a text-to-speech platform is outlined. Die vorliegende Dissertation untersucht die automatische Erzeugung von gesturalpartiturbasierten Steuerdaten für ein dreidimensionales artikulatorisches Sprachsynthesesystem. Die gesturalen Partituren werden in einem artikulatorischen Resynthese-Paradigma mittels dynamischer Programmierung optimiert, unter Zuhilfenahme einer Kostenfunktion, die den Abstand zu einem "Gold Standard" in Form natürlicher Sprachproduktionsdaten mißt. Diese Daten waren mit elektromagnetischer Artikulographie am selben Sprecher aufgenommen worden, an den zuvor das Vokaltraktmodell des Synthesesystems angepaßt worden war. Weiterführende Forschung, eine englische Stimme für das Synthesesystem zu erzeugen und sie in eine Text-to-Speech-Plattform einzubetten, wird umrissen.
Link to this record:	urn:nbn:de:bsz:291-scidok-32243 hdl:20.500.11880/23603 http://dx.doi.org/10.22028/D291-23547
Advisor:	Barry, William
Date of oral examination:	19-May-2010
Date of registration:	10-Aug-2010
Faculty:	P - Philosophische Fakultät
Department:	P - Sprachwissenschaft und Sprachtechnologie
Former Department:	bis SS 2016: Fachrichtung 4.7 - Allgemeine Linguistik
Collections:	SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:

File	Description	Size	Format
Diss_Steiner_korr.pdf		18,68 MB	Adobe PDF	View/Open

Export: BibTex