Bitte benutzen Sie diese Referenz, um auf diese Ressource zu verweisen:
doi:10.22028/D291-40171
Titel: | On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers |
VerfasserIn: | Jágrová, Klára Hedderich, Michael Mosbach, Marius Avgustinova, Tania Klakow, Dietrich |
Sprache: | Englisch |
Titel: | Frontiers in Psychology |
Bandnummer: | 12 |
Verlag/Plattform: | Frontiers |
Erscheinungsjahr: | 2021 |
Freie Schlagwörter: | intercomprehension predictive context Polish Czech context-aware language models Long Short-Term Memory transformer surprisal |
DDC-Sachgruppe: | 400 Sprache, Linguistik |
Dokumenttyp: | Journalartikel / Zeitschriftenartikel |
Abstract: | This contribution seeks to provide a rational probabilistic explanation for the intelligibility of words in a genetically related language that is unknown to the reader, a phenomenon referred to as intercomprehension. In this research domain, linguistic distance, among other factors, was proved to correlate well with the mutual intelligibility of individual words. However, the role of context for the intelligibility of target words in sentences was subject to very few studies. To address this, we analyze data from web-based experiments in which Czech (CS) respondents were asked to translate highly predictable target words at the final position of Polish sentences. We compare correlations of target word intelligibility with data from 3-g language models (LMs) to their correlations with data obtained from context-aware LMs. More specifically, we evaluate two context-aware LM architectures: Long Short-Term Memory (LSTMs) that can, theoretically, take infinitely long-distance dependencies into account and Transformer-based LMs which can access the whole input sequence at the same time. We investigate how their use of context affects surprisal and its correlation with intelligibility. |
DOI der Erstveröffentlichung: | 10.3389/fpsyg.2021.662277 |
URL der Erstveröffentlichung: | https://www.frontiersin.org/articles/10.3389/fpsyg.2021.662277 |
Link zu diesem Datensatz: | urn:nbn:de:bsz:291--ds-401716 hdl:20.500.11880/36145 http://dx.doi.org/10.22028/D291-40171 |
ISSN: | 1664-1078 |
Datum des Eintrags: | 21-Jul-2023 |
Bezeichnung des in Beziehung stehenden Objekts: | Supplementary Material |
In Beziehung stehendes Objekt: | https://ndownloader.figstatic.com/files/28652820 |
Fakultät: | P - Philosophische Fakultät |
Fachrichtung: | P - Sprachwissenschaft und Sprachtechnologie |
Professur: | P - Prof. Dr. Dietrich Klakow |
Sammlung: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Dateien zu diesem Datensatz:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
fpsyg-12-662277.pdf | 1,75 MB | Adobe PDF | Öffnen/Anzeigen |
Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons