A sarcasm detection framework in Twitter and blog posts based on varied range of feature sets

dc.contributor.advisorGhorbani, Ali
dc.contributor.authorMinaee, Hamed
dc.date.accessioned2023-03-01T16:46:25Z
dc.date.available2023-03-01T16:46:25Z
dc.date.issued2016
dc.date.updated2022-11-08T00:00:00Z
dc.description.abstractThis thesis addresses the problem of sarcasm detection by using a framework which is designed to effectively detect sarcastic blog and microblog posts. This framework consists of two components. Each component consists of different sub components including crawler, preprocessing and classification. The long text sarcasm detection classification consists of a two-step process, in each step, we use some feature sets along with different classifiers. These feature sets are utilized to analyze each blog post as a whole in addition to every isolated sentence. In the first step, Scoring Component is used to classify the documents into groups of sarcastic and non-sarcastic. Also in order to find sarcastic sentences in each sarcastic document, Decision Tree is applied. Considering the difficulties in sarcasm detection, the Document Level Sarcasm Detection achieved an outstanding result: 75.7% Precision rate. In the Short Text, Decision Tree is applied in order to classify the tweet texts into groups of sarcastic and non-sarcastic. Precision of 86.6% is obtained for this component which is very good considering the difficulty of sarcasm detection as well as inherent complexity of Twitter texts.
dc.description.copyright© Hamed Minaee, 2016
dc.description.noteM.C.S. University of New Brunswick, Faculty of Computer Science, 2016.
dc.formattext/xml
dc.format.extentviii, 92 pages
dc.format.mediumelectronic
dc.identifier.oclcOCLC #1350487777
dc.identifier.otherThesis 9964
dc.identifier.urihttps://unbscholar.lib.unb.ca/handle/1882/14461
dc.language.isoen_CA
dc.publisherUniversity of New Brunswick
dc.rightshttp://purl.org/coar/access_right/c_abf2
dc.subject.disciplineComputer Science
dc.subject.lcshTwitter -- Sarcasm detection.
dc.subject.lcshBlog authorship -- Sarcasm detection.
dc.subject.lcshSocial media -- Authorship.
dc.subject.lcshIrony -- Social aspects.
dc.subject.lcshText data mining.
dc.titleA sarcasm detection framework in Twitter and blog posts based on varied range of feature sets
dc.typemaster thesis
thesis.degree.disciplineComputer Science
thesis.degree.fullnameMaster of Computer Science
thesis.degree.grantorUniversity of New Brunswick
thesis.degree.levelmasters
thesis.degree.nameM.C.S.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
item.pdf
Size:
444.3 KB
Format:
Adobe Portable Document Format