A sarcasm detection framework in Twitter and blog posts based on varied range of feature sets
dc.contributor.advisor | Ghorbani, Ali | |
dc.contributor.author | Minaee, Hamed | |
dc.date.accessioned | 2023-03-01T16:46:25Z | |
dc.date.available | 2023-03-01T16:46:25Z | |
dc.date.issued | 2016 | |
dc.date.updated | 2022-11-08T00:00:00Z | |
dc.description.abstract | This thesis addresses the problem of sarcasm detection by using a framework which is designed to effectively detect sarcastic blog and microblog posts. This framework consists of two components. Each component consists of different sub components including crawler, preprocessing and classification. The long text sarcasm detection classification consists of a two-step process, in each step, we use some feature sets along with different classifiers. These feature sets are utilized to analyze each blog post as a whole in addition to every isolated sentence. In the first step, Scoring Component is used to classify the documents into groups of sarcastic and non-sarcastic. Also in order to find sarcastic sentences in each sarcastic document, Decision Tree is applied. Considering the difficulties in sarcasm detection, the Document Level Sarcasm Detection achieved an outstanding result: 75.7% Precision rate. In the Short Text, Decision Tree is applied in order to classify the tweet texts into groups of sarcastic and non-sarcastic. Precision of 86.6% is obtained for this component which is very good considering the difficulty of sarcasm detection as well as inherent complexity of Twitter texts. | |
dc.description.copyright | © Hamed Minaee, 2016 | |
dc.description.note | M.C.S. University of New Brunswick, Faculty of Computer Science, 2016. | |
dc.format | text/xml | |
dc.format.extent | viii, 92 pages | |
dc.format.medium | electronic | |
dc.identifier.oclc | OCLC #1350487777 | |
dc.identifier.other | Thesis 9964 | |
dc.identifier.uri | https://unbscholar.lib.unb.ca/handle/1882/14461 | |
dc.language.iso | en_CA | |
dc.publisher | University of New Brunswick | |
dc.rights | http://purl.org/coar/access_right/c_abf2 | |
dc.subject.discipline | Computer Science | |
dc.subject.lcsh | Twitter -- Sarcasm detection. | |
dc.subject.lcsh | Blog authorship -- Sarcasm detection. | |
dc.subject.lcsh | Social media -- Authorship. | |
dc.subject.lcsh | Irony -- Social aspects. | |
dc.subject.lcsh | Text data mining. | |
dc.title | A sarcasm detection framework in Twitter and blog posts based on varied range of feature sets | |
dc.type | master thesis | |
thesis.degree.discipline | Computer Science | |
thesis.degree.fullname | Master of Computer Science | |
thesis.degree.grantor | University of New Brunswick | |
thesis.degree.level | masters | |
thesis.degree.name | M.C.S. |
Files
Original bundle
1 - 1 of 1