Hard-label based Small Query Black-box Adversarial Attack (2403.06014v1)
Abstract: We consider the hard label based black box adversarial attack setting which solely observes predicted classes from the target model. Most of the attack methods in this setting suffer from impractical number of queries required to achieve a successful attack. One approach to tackle this drawback is utilising the adversarial transferability between white box surrogate models and black box target model. However, the majority of the methods adopting this approach are soft label based to take the full advantage of zeroth order optimisation. Unlike mainstream methods, we propose a new practical setting of hard label based attack with an optimisation process guided by a pretrained surrogate model. Experiments show the proposed method significantly improves the query efficiency of the hard label based black-box attack across various target model architectures. We find the proposed method achieves approximately 5 times higher attack success rate compared to the benchmarks, especially at the small query budgets as 100 and 250.
- https://github.com/huyvnphan.
- https://github.com/u39kun/deep-learning-benchmark. Technical report.
- https://pytorch.org/vision/stable/models.html.
- Practical black-box attacks on deep neural networks using efficient query mechanisms. The European Conference on Computer Vision, 2018.
- Decision based adversarial attacks: Reliable attacks against black-box machine learning models. International Conference on Learning Representation, 2018.
- Guessing smart: Biased sampling for efficient black-box adversarial attacks. arXiv preprent arXiv:1812.09803v3, 2019.
- On evaluating adversarial robustness. arXiv preprint arXiv:1902.06705, 2019.
- Hopskipjumpattack: A query-efficient decision-based attack. IEEE Symposium on Security and Privacy, 2020.
- Zoo: Zeroth order optimisation based black-box attacks to deep neural networks without training substitute models. ACM Workshop on Artificial Inteligence and Security, 2017.
- Sign-opt: A query-efficient hard-label adversarial attack. International Conference on Learning Representations, 2020.
- Imagenet: A large-scale hierarchical image database. IEEE Conference on Computer Vision and Pattern Recognition, 2009.
- ”deepfool”: A simple and accurate method to fool deep neural networks. IEEE Conference on Computer Vision and Pattern Recognition, 2016.
- Query-efficient black-box adversarial attacks guided by a transfer-based prior. arXiv:2203.06560v1, 2022.
- Boosting adversarial attacks with momentum. IEEE Conference on Computer Vision and Pattern Recognition, 2018.
- Evading defenses to transferable adversarial examples by translation-invariant attacks. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.
- Empirical comparison of ”hard” and ”soft” label propagation for relational classification. The 17th International Conference on Inductive Logic Programming, 2007.
- Explaining and harnessing adversarial examples. International Conference on Learning Representations, 2015.
- Simple black-box adversarial attacks. International Conference on Learning Representation, 2019.
- Black-box adversarial attack with transferable model-based embedding. International Conference on Learning Representations, 2020.
- Black-box adversarial attacks with limited queries and information. International Conference on Machine Learning, 2018.
- Prior convictions: Black-box adversarial attacks with bandits and priors. International Conference on Learning Representations, 2019.
- Adam: A method for stochastic optimizastion. International Conference on Learning Representations, 2015.
- Learning multiple layers of features from tiny images. Techinical Report, Citeseer, 2009.
- Qeba: Query-efficient boundary-based blackbox attack. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- Nesterov accelerated gradient and scale invariance for adversarial attacks. International Conference on Learning Representations, 2020.
- Towards deep learning models resistant to adversarial attacks. International Conference on Learning Representations, 2018.
- Transferability in machine learning: from phenomena to black-box attacks using adversarial samples. arXiv preprint arXiv:1605.07277, 2016.
- Practical black-box attacks against machine learning. Proceedings of the 2017 ACM on Asia conference on computer and communicaitons security, 2017.
- Intriguing properties of neural networks. International Conference on Learning Representations, 2013.
- The space of transferable adversarial examples. arXiv preprint arXiv:1704.03453, 2017.
- Autozoom: Autoencoder-based zeroth order optimization method for attacking black-box neural networks. The Thirty-Third AAAI Conference on Artificial Intelligence, 2019.
- Enhancing the transferability of adversarial attacks through variance tuning. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021.
- Improving transferability of adversarial examples with input diversity. IEEE Conference on Computer Vision and Pattern Recognition, 2019.
- Learning black-box attackers with transferable priors and query feedback. Conference on Neural Information Processing System, 2020.
- Adversarial eigen attack on black-box models. arXiv:2009.00097, 2020.