A comparison of machine learning algorithms for zero-shot cross-lingual phishing detection
dc.contributor.advisor | Hakak, Saqib | |
dc.contributor.advisor | Cook, Paul | |
dc.contributor.author | Staples, Dakota | |
dc.date.accessioned | 2024-02-22T14:54:09Z | |
dc.date.available | 2024-02-22T14:54:09Z | |
dc.date.issued | 2023-08 | |
dc.description.abstract | Phishing is a major problem worldwide. Existing studies have focused mainly on detecting emails in one language (mostly English). However, detecting emails in multiple languages is challenging due to a lack of datasets. Without ample data from which to learn, the models cannot detect a benign email from a spam email accurately, resulting in false positives and negatives. This research aims to compare the performance of numerous machine learning models and transformers using zero-shot learning for multilingual phishing detection. In a zero-shot learning set-up, the model is trained on one language and tested on another. English, French, and Russian emails are used as the training and testing languages. My results show that, on average, XLM-Roberta performs the best out of all the tested models in terms of accuracy scoring 99% testing on English, 99% testing on French, and 95% testing on Russian. | |
dc.description.copyright | © Dakota Staples, 2023 | |
dc.format.extent | x, 74 | |
dc.format.medium | electronic | |
dc.identifier.oclc | (OCoLC)1425905490 | en |
dc.identifier.other | Thesis 11295 | en |
dc.identifier.uri | https://unbscholar.lib.unb.ca/handle/1882/37716 | |
dc.language.iso | en | |
dc.publisher | University of New Brunswick | |
dc.relation | University of New Brunswick, Faculty of Computer Science | |
dc.rights | http://purl.org/coar/access_right/c_abf2 | |
dc.subject.discipline | Computer Science | |
dc.subject.lcsh | Phishing. | en |
dc.subject.lcsh | Language and languages. | en |
dc.subject.lcsh | Electronic mail messages. | en |
dc.subject.lcsh | Machine learning. | en |
dc.title | A comparison of machine learning algorithms for zero-shot cross-lingual phishing detection | |
dc.type | master thesis | |
oaire.license.condition | other | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | University of New Brunswick | |
thesis.degree.level | masters | |
thesis.degree.name | M.C.S. |