Please use this identifier to cite or link to this item: doi:10.22028/D291-38835
Volltext verfügbar? / Dokumentlieferung
Files for this record:
There are no files associated with this item.
Title: Establishing Annotation Quality in Multi-Label Annotations
Author(s): Marchal, Marian
Scholman, Merel Cleo Johanna UdsID
Yung, Frances Pikyu UdsID
Demberg, Vera UdsID
Editor(s): Scherrer, Yves
Language: English
In:
Title: Proceedings of the Ninth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2022) - the 29th International Conference on Computational Linguistics : October 12-17, 2022, Gyeongju, Republic of Korea
Pages: 3659-3668
Publisher/Platform: ACL
Year of Publication: 2022
Place of publication: [Stroudsburg, PA]
Place of the conference: Gyeongju, Republic of Korea
DDC notations: 400 Language, linguistics
Publikation type: Conference Paper
Abstract: In many linguistic fields requiring annotated data, multiple interpretations of a single item are possible. Multi-label annotations more accurately reflect this possibility. However, allowing for multi-label annotations also affects the chance that two coders agree with each other. Calculating inter-coder agreement for multi-label datasets is therefore not trivial. In the current contribution, we evaluate different metrics for calculating agreement on multi-label annotations: agreement on the intersection of annotated labels, an augmented version of Cohen’s Kappa, and precision, recall and F1. We propose a bootstrapping method to obtain chance agreement for each measure, which allows us to obtain an adjusted agreement coefficient that is more interpretable. We demonstrate how various measures affect estimates of agreement on simulated datasets and present a case study of discourse relation annotations. We also show how the proportion of double labels, and the entropy of the label distribution, influences the measures outlined above and how a bootstrapped adjusted agreement can make agreement measures more comparable across datasets in multi-label scenarios.
Link to this record: urn:nbn:de:bsz:291--ds-388358
hdl:20.500.11880/35019
http://dx.doi.org/10.22028/D291-38835
Date of registration: 26-Jan-2023
Faculty: MI - Fakultät für Mathematik und Informatik
Department: MI - Informatik
Professorship: MI - Prof. Dr. Vera Demberg
Collections:SciDok - Der Wissenschaftsserver der Universität des Saarlandes



Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.