Word meaning in context : a probabilistic model and its application to question answering

Dinu, Georgiana

Please use this identifier to cite or link to this item: doi:10.22028/D291-23617

Title:	Word meaning in context : a probabilistic model and its application to question answering
Author(s):	Dinu, Georgiana
Language:	English
Year of Publication:	2011
SWD key words:	Computerlinguistik Semantik Maschinelles Lernen
Free key words:	computational linguistics semantics machine learning
DDC notations:	400 Language, linguistics
Publikation type:	Dissertation
Abstract:	The need for assessing similarity in meaning is central to most language technology applications. Distributional methods are robust, unsupervised methods which achieve high performance on this task. These methods measure similarity of word types solely based on patterns of word occurrences in large corpora, following the intuition that similar words occur in similar contexts. As most Natural Language Processing (NLP) applications deal with disambiguated words, words occurring in context, rather than word types, the question of adapting distributional methods to compute sense-speciﬁc or context-sensitive similarities has gained increasing attention in recent work. This thesis focuses on the development and applications of distributional methods for context-sensitive similarity. The contribution made is twofold: the main part of the thesis proposes and tests a new framework for computing similarity in context, while the second part investigates the application of distributional paraphrasing to the task of question answering. Die Notwendigkeit der Beurteilung von Bedeutungsähnlichkeit spielt für die meisten sprachtechnologische Anwendungen eine wesentliche Rolle. Distributionelle Verfahren sind solide, unbeaufsichtigte Verfahren, die für diese Aufgabe sehr eﬀektiv sind. Diese Verfahren messen die Ähnlichkeit von Wortarten lediglich auf Basis von Mustern, nach denen die Wörter in großen Korpora vorkommen, indem sie der Erkenntnis folgen, dass ähnliche Wörter in ähnlichen Kontexten auftreten. Da die meisten Anwendungen im Natural Language Processing (NLP) mit eindeutigen Wörtern arbeiten, also eher Wörtern, die im Kontext vorkommen, als Wortarten, hat die Frage, ob distributionelle Verfahren angepasst werden sollten, um bedeutungsspeziﬁsche oder kontextabhängige Ähnlichkeiten zu berechnen, in neueren Arbeiten zunehmend an Bedeutung gewonnen. Diese Dissertation konzentriert sich auf die Entwicklung und Anwendungen von distributionellen Verfahren für kontextabhängige Ähnlichkeit und liefert einen doppelten Beitrag: Den Hauptteil der Arbeit bildet die Präsentation und Erprobung eines neuen framework für die Berechnung von Ähnlichkeit im Kontext. Im zweiten Teil der Arbeit wird die Anwendung des distributional paraphrasing auf die Aufgabe der Fragenbeantwortung untersucht.
Link to this record:	urn:nbn:de:bsz:291-scidok-50031 hdl:20.500.11880/23673 http://dx.doi.org/10.22028/D291-23617
Advisor:	Pinkal, Manfred
Date of oral examination:	22-Dec-2011
Date of registration:	10-Dec-2012
Faculty:	P - Philosophische Fakultät
Department:	P - Sprachwissenschaft und Sprachtechnologie
Former Department:	bis SS 2016: Fachrichtung 4.7 - Allgemeine Linguistik
Collections:	SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:

File	Description	Size	Format
main.pdf		3,02 MB	Adobe PDF	View/Open

Export: BibTex