That ain’t how I speak: Personalizing natural language processing

dc.contributor.advisorCook, Paul
dc.contributor.authorKing, Milton
dc.date.accessioned2023-09-18T13:28:57Z
dc.date.available2023-09-18T13:28:57Z
dc.date.issued2021-10
dc.description.abstractNatural language processing (NLP) involves automatically analyzing text written by human authors. People develop their own use of a language known as an idiolect, which could result in poor performance from generic NLP systems. Ideally, each person would have their own personalized system that is tailored toward them. In this thesis, I demonstrate the potential benefits of personalizing systems in three different NLP tasks, which include language modeling (estimating the probability of a sequence of words), authorship verification (determining if a document belongs to a specific person), and word sense disambiguation (assigning a dictionary-like meaning to a word in context). Personalization in these topics has not been widely studied and to the best of my knowledge, this is the first work to consider personalization with word sense disambiguation, for which I design a novel dataset. For each task, I show the increase in performance that the proposed personalized models have against state-of-the-art models. The experiments in this thesis are designed without consideration of people’s demographic and all personalized methods require relatively low amounts of text from an individual. These two criteria are respected to ensure the personalized methods work well for each individual regardless of their demographic or the amount of text they have authored.
dc.description.copyright©Milton King, 2021
dc.format.extentviii, 103
dc.format.mediumelectronic
dc.identifier.oclc(OCoLC)1417605777en
dc.identifier.otherThesis 10954en
dc.identifier.urihttps://unbscholar.lib.unb.ca/handle/1882/37406
dc.language.isoen
dc.publisherUniversity of New Brunswick
dc.rightshttp://purl.org/coar/access_right/c_abf2
dc.subject.disciplineComputer Science
dc.subject.lcshNatural language processing (Computer science)en
dc.subject.lcshComputational linguistics.en
dc.subject.lcshSemantics--Data processing.en
dc.titleThat ain’t how I speak: Personalizing natural language processing
dc.typedoctoral thesis
oaire.license.conditionother
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of New Brunswick
thesis.degree.leveldoctorate
thesis.degree.namePh.D.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Milton King - Dissertation.pdf
Size:
3.42 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.13 KB
Format:
Item-specific license agreed upon to submission
Description: