Please use this identifier to cite or link to this item:
Volltext verfügbar? / Dokumentlieferung
doi:10.22028/D291-30979
Title: | The Extended SPaRKy Restaurant Corpus: Designing a Corpus with Variable Information Density |
Author(s): | Howcroft, David M. Klakow, Dietrich Demberg, Vera |
Language: | English |
Title: | Situated interaction : 18th Annual Conference of the International Speech Communication Association (INTERSPEECH 2017) : Stockholm, Sweden, 20-24 August 2017 : Volume 6 |
Startpage: | 3757 |
Endpage: | 3761 |
Publisher/Platform: | Curran Associates, Inc. |
Year of Publication: | 2017 |
Place of publication: | Red Hook, NY |
Title of the Conference: | Interspeech 2017 |
Place of the conference: | Stockholm, Sweden |
Publikation type: | Conference Paper |
Abstract: | Natural language generation (NLG) systems rely on corpora for both hand-crafted approaches in a traditional NLG architecture and for statistical end-to-end (learned) generation systems. Limitations in existing resources, however, make it difficult to develop systems which can vary the linguistic properties of an utterance as needed. For example, when users’ attention is split between a linguistic and a secondary task such as driving, a generation system may need to reduce the information density of an utterance to compensate for the reduction in user attention. We introduce a new corpus in the restaurant recommendation and comparison domain, collected in a paraphrasing paradigm, where subjects wrote texts targeting either a general audience or an elderly family member. This design resulted in a corpus of more than 5000 texts which exhibit a variety of lexical and syntactic choices and differ with respect to average word & sentence length and surprisal. The corpus includes two levels of meaning representation: flat ‘semantic stacks’ for propositional content and Rhetorical Structure Theory (RST) relations between these propositions. |
DOI of the first publication: | 10.21437/Interspeech.2017-1555 |
URL of the first publication: | https://www.isca-speech.org/archive/Interspeech_2017/abstracts/1555.html |
Link to this record: | hdl:20.500.11880/29708 http://dx.doi.org/10.22028/D291-30979 |
ISBN: | 978-1-5108-4876-4 |
Date of registration: | 23-Sep-2020 |
Faculty: | MI - Fakultät für Mathematik und Informatik |
Department: | MI - Informatik |
Professorship: | MI - Prof. Dr. Vera Demberg |
Collections: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Files for this record:
There are no files associated with this item.
Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.