Please use this identifier to cite or link to this item: doi:10.22028/D291-42289
Volltext verfügbar? / Dokumentlieferung
Title: Expert-adapted language models improve the fit to reading times
Author(s): Škrjanec, Iza
Broy, Frederik Yannick
Demberg, Vera
Language: English
Title: Procedia Computer Science
Volume: 225
Pages: 3488-3497
Publisher/Platform: Elsevier
Year of Publication: 2023
Place of publication: Amsterdam
Place of the conference: Athens, Greece
Free key words: Eye tracking
Background knowledge
Surprisal
DDC notations: 004 Computer science, internet
400 Language, linguistics
Publikation type: Conference Paper
Abstract: The concept of surprisal refers to the predictability of a word based on its context. Surprisal is known to be predictive of human processing difficulty and is usually estimated by language models. However, because humans differ in their linguistic experience, they also differ in the actual processing difficulty they experience with a given word or sentence. We investigate whether models that are similar to the linguistic experience and background knowledge of a specific group of humans are better at predicting their reading times than a generic language model. We analyze reading times from the PoTeC corpus [15,27] of eye movements from biology and physics experts reading biology and physics texts. We find experts read in-domain texts faster than novices, especially domain-specific terms. Next, we train language models adapted to the biology and physics domains and show that surprisal obtained from these specialized models improves the fit to expert reading times above and beyond a generic language model.
DOI of the first publication: 10.1016/j.procs.2023.10.344
URL of the first publication: https://www.sciencedirect.com/science/article/pii/S1877050923015028
Link to this record: urn:nbn:de:bsz:291--ds-422898
hdl:20.500.11880/37960
http://dx.doi.org/10.22028/D291-42289
ISSN: 1877-0509
Date of registration: 27-Jun-2024
Notes: Procedia Computer Science, Volume 225, 2023, Pages 3488-3497
Faculty: MI - Fakultät für Mathematik und Informatik
Department: MI - Informatik
Professorship: MI - Prof. Dr. Vera Demberg
Collections:SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:
There are no files associated with this item.


Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.