Please use this identifier to cite or link to this item: doi:10.22028/D291-30979
Volltext verfügbar? / Dokumentlieferung
Title: The Extended SPaRKy Restaurant Corpus: Designing a Corpus with Variable Information Density
Author(s): Howcroft, David M.
Klakow, Dietrich
Demberg, Vera
Language: English
Title: Situated interaction : 18th Annual Conference of the International Speech Communication Association
Startpage: 3757
Endpage: 3761
Publisher/Platform: Curran Associates, Inc.
Year of Publication: 2017
Place of publication: Red Hook, NY
Title of the Conference: Interspeech 2017
Place of the conference: Stockholm, Sweden
Publikation type: Conference Paper
Abstract: Natural language generation (NLG) systems rely on corpora for both hand-crafted approaches in a traditional NLG architecture and for statistical end-to-end (learned) generation systems. Limitations in existing resources, however, make it difficult to develop systems which can vary the linguistic properties of an utterance as needed. For example, when users’ attention is split between a linguistic and a secondary task such as driving, a generation system may need to reduce the information density of an utterance to compensate for the reduction in user attention. We introduce a new corpus in the restaurant recommendation and comparison domain, collected in a paraphrasing paradigm, where subjects wrote texts targeting either a general audience or an elderly family member. This design resulted in a corpus of more than 5000 texts which exhibit a variety of lexical and syntactic choices and differ with respect to average word & sentence length and surprisal. The corpus includes two levels of meaning representation: flat ‘semantic stacks’ for propositional content and Rhetorical Structure Theory (RST) relations between these propositions.
DOI of the first publication: 10.21437/Interspeech.2017-1555
URL of the first publication: https://www.isca-speech.org/archive/Interspeech_2017/abstracts/1555.html
Link to this record: hdl:20.500.11880/29708
http://dx.doi.org/10.22028/D291-30979
ISBN: 978-1-5108-4876-4
Date of registration: 23-Sep-2020
Notes: volume 6
Faculty: MI - Fakultät für Mathematik und Informatik
Department: MI - Informatik
Professorship: MI - Prof. Dr. Vera Demberg
Collections:UniBib – Die Universitätsbibliographie

Files for this record:
There are no files associated with this item.


Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.