Please use this identifier to cite or link to this item:
Volltext verfügbar? / Dokumentlieferung
doi:10.22028/D291-42289
Title: | Expert-adapted language models improve the fit to reading times |
Author(s): | Škrjanec, Iza Broy, Frederik Yannick Demberg, Vera |
Language: | English |
Title: | Procedia Computer Science |
Volume: | 225 |
Pages: | 3488-3497 |
Publisher/Platform: | Elsevier |
Year of Publication: | 2023 |
Place of publication: | Amsterdam |
Place of the conference: | Athens, Greece |
Free key words: | Eye tracking Background knowledge Surprisal |
DDC notations: | 004 Computer science, internet 400 Language, linguistics |
Publikation type: | Conference Paper |
Abstract: | The concept of surprisal refers to the predictability of a word based on its context. Surprisal is known to be predictive of human processing difficulty and is usually estimated by language models. However, because humans differ in their linguistic experience, they also differ in the actual processing difficulty they experience with a given word or sentence. We investigate whether models that are similar to the linguistic experience and background knowledge of a specific group of humans are better at predicting their reading times than a generic language model. We analyze reading times from the PoTeC corpus [15,27] of eye movements from biology and physics experts reading biology and physics texts. We find experts read in-domain texts faster than novices, especially domain-specific terms. Next, we train language models adapted to the biology and physics domains and show that surprisal obtained from these specialized models improves the fit to expert reading times above and beyond a generic language model. |
DOI of the first publication: | 10.1016/j.procs.2023.10.344 |
URL of the first publication: | https://www.sciencedirect.com/science/article/pii/S1877050923015028 |
Link to this record: | urn:nbn:de:bsz:291--ds-422898 hdl:20.500.11880/37960 http://dx.doi.org/10.22028/D291-42289 |
ISSN: | 1877-0509 |
Date of registration: | 27-Jun-2024 |
Notes: | Procedia Computer Science, Volume 225, 2023, Pages 3488-3497 |
Faculty: | MI - Fakultät für Mathematik und Informatik |
Department: | MI - Informatik |
Professorship: | MI - Prof. Dr. Vera Demberg |
Collections: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Files for this record:
There are no files associated with this item.
Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.