A query-efficient black-box adversarial attack on text classification Deep Neural Networks

Yadollahi, Mohammad Mehdi

A query-efficient black-box adversarial attack on text classification Deep Neural Networks

dc.contributor.advisor	Ghorbani, Ali A.
dc.contributor.advisor	Lashkari, Arash Habibi
dc.contributor.author	Yadollahi, Mohammad Mehdi
dc.date.accessioned	2023-09-22T13:07:25Z
dc.date.available	2023-09-22T13:07:25Z
dc.date.issued	2022
dc.description.abstract	Recent work has demonstrated that modern text classifiers trained on Deep Neural Networks are vulnerable to adversarial attacks. There are insufficient studies on text data compared to the image domain, and the lack of investigation originates from the special challenges of the NLP domain. Despite being extremely effective, most adversarial attacks in the text domain ignore the overhead they induced on the victim model. In this research, we propose a Query-Efficient black-box adversarial attack named EQFooler on text data that tries to attack a textual deep neural network while considering the amount of overhead that it may produce. The evaluation of our method shows that the results are promising. We demonstrate the impact of keyword extraction methods in generating query-efficient adversarial attacks. Four variants of the EQFooler mode are developed based on different keyword extractors and importance score strategies. We compare the performance of these variants in terms of four evaluation metrics, namely original accuracy, adversarial accuracy, change rate, and number of queries. All the variants of the proposed attack significantly reduce the accuracy of the targeted models. Among those variants, EQFooler-Rake-MS has the best functionality in terms of adversarial accuracy, change rate and the number of queries needed. Also, multiple experiments are designed to compare the outcomes of the proposed method with the state-of-the-art adversarial attacks as a baseline. The results show that the EQFooler is as powerful as the state-of-the-art adversarial attacks while requiring fewer queries to the victim model. In addition, we study the transferability of the generated adversarial examples. Compared to the baseline in any transfer setting, at least one of the variants has better outcomes than the baseline.
dc.description.copyright	© Mohammad Mehdi Yadollahi, 2022
dc.format.extent	xii, 81
dc.format.medium	electronic
dc.identifier.oclc	(OCoLC)1419184479	en
dc.identifier.other	Thesis 10998	en
dc.identifier.uri	https://unbscholar.lib.unb.ca/handle/1882/37450
dc.language.iso	en
dc.publisher	University of New Brunswick
dc.rights	http://purl.org/coar/access_right/c_abf2
dc.subject.discipline	Computer Science
dc.subject.lcsh	Querying (Computer science)	en
dc.subject.lcsh	Neural networks (Computer science)	en
dc.subject.lcsh	Machine learning.	en
dc.subject.lcsh	Cyberterrorism--Prevention.	en
dc.title	A query-efficient black-box adversarial attack on text classification Deep Neural Networks
dc.type	master thesis
oaire.license.condition	other
thesis.degree.discipline	Computer Science
thesis.degree.grantor	University of New Brunswick
thesis.degree.level	masters
thesis.degree.name	M.C.S.

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Mohammad Mehdi Yadollahi - Thesis.pdf
Size:: 3.68 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.13 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Open Theses & Dissertations