Determining if this word is used like that word: predicting usage similarity with supervised and unsupervised approaches

King, Milton

Determining if this word is used like that word: predicting usage similarity with supervised and unsupervised approaches

Files

item.pdf (593.74 KB)

Date

2017

Authors

King, Milton

Publisher

University of New Brunswick

Abstract

Determining the meaning of a word in context is an important task for a variety of natural language processing applications such as translating between languages, summarizing paragraphs, and phrase completion. Usage similarity (USim) is an approach to describe the meaning of a word in context that does not rely on a sense inventory -- a set of dictionary-like definitions. Instead, pairs of usages of a target word are rated in terms of their similarity on a scale. In this thesis, we evaluate unsupervised approaches to USim based on embeddings for words, contexts, and sentences, and achieve state-of-the-art results over two USim datasets. We further consider supervised approaches to USim, and find that they can increase the performance of our models. We look into a more detailed evaluation, observing the performance on different parts-of-speech as well as the change in performance when using different features. Our models also do competitively well in two word sense induction tasks, which involve clustering instances of a word based on the meaning of the word in context.

URI

https://unbscholar.lib.unb.ca/handle/1882/13204

Collections

Open Theses & Dissertations

Full item page

Determining if this word is used like that word: predicting usage similarity with supervised and unsupervised approaches

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections