Please use this identifier to cite or link to this item: doi:10.22028/D291-40171
Title: On the Correlation of Context-Aware Language Models With the Intelligibility of Polish Target Words to Czech Readers
Author(s): Jágrová, Klára
Hedderich, Michael
Mosbach, Marius
Avgustinova, Tania
Klakow, Dietrich
Language: English
Title: Frontiers in Psychology
Volume: 12
Publisher/Platform: Frontiers
Year of Publication: 2021
Free key words: intercomprehension
predictive context
Polish
Czech
context-aware language models
Long Short-Term Memory
transformer
surprisal
DDC notations: 400 Language, linguistics
Publikation type: Journal Article
Abstract: This contribution seeks to provide a rational probabilistic explanation for the intelligibility of words in a genetically related language that is unknown to the reader, a phenomenon referred to as intercomprehension. In this research domain, linguistic distance, among other factors, was proved to correlate well with the mutual intelligibility of individual words. However, the role of context for the intelligibility of target words in sentences was subject to very few studies. To address this, we analyze data from web-based experiments in which Czech (CS) respondents were asked to translate highly predictable target words at the final position of Polish sentences. We compare correlations of target word intelligibility with data from 3-g language models (LMs) to their correlations with data obtained from context-aware LMs. More specifically, we evaluate two context-aware LM architectures: Long Short-Term Memory (LSTMs) that can, theoretically, take infinitely long-distance dependencies into account and Transformer-based LMs which can access the whole input sequence at the same time. We investigate how their use of context affects surprisal and its correlation with intelligibility.
DOI of the first publication: 10.3389/fpsyg.2021.662277
URL of the first publication: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.662277
Link to this record: urn:nbn:de:bsz:291--ds-401716
hdl:20.500.11880/36145
http://dx.doi.org/10.22028/D291-40171
ISSN: 1664-1078
Date of registration: 21-Jul-2023
Description of the related object: Supplementary Material
Related object: https://ndownloader.figstatic.com/files/28652820
Faculty: P - Philosophische Fakultät
Department: P - Sprachwissenschaft und Sprachtechnologie
Professorship: P - Prof. Dr. Dietrich Klakow
Collections:SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:
File Description SizeFormat 
fpsyg-12-662277.pdf1,75 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons