AIセキュリティポータル K Program
Robust Image Classification: Defensive Strategies against FGSM and PGD Adversarial Attacks
Share
Abstract
Adversarial attacks, particularly the Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) pose significant threats to the robustness of deep learning models in image classification. This paper explores and refines defense mechanisms against these attacks to enhance the resilience of neural networks. We employ a combination of adversarial training and innovative preprocessing techniques, aiming to mitigate the impact of adversarial perturbations. Our methodology involves modifying input data before classification and investigating different model architectures and training strategies. Through rigorous evaluation of benchmark datasets, we demonstrate the effectiveness of our approach in defending against FGSM and PGD attacks. Our results show substantial improvements in model robustness compared to baseline methods, highlighting the potential of our defense strategies in real-world applications. This study contributes to the ongoing efforts to develop secure and reliable machine learning systems, offering practical insights and paving the way for future research in adversarial defense. By bridging theoretical advancements and practical implementation, we aim to enhance the trustworthiness of AI applications in safety-critical domains.
Explaining and harnessing adversarial examples
I. J. Goodfellow, J. Shlens, C. Szegedy
Published: 2015
Towards Deep Learning Models Resistant to Adversarial Attacks
Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu
Published: 2017.6.20
Distillation as a Defense to Adversarial Perturbations against Deep Neural Networks
Nicolas Papernot, Patrick McDaniel, Xi Wu, Somesh Jha, Ananthram Swami
Published: 2015.11.14
Adversarial attacks on image classification models: Analysis and defense
J. Sen, A. Sen, A. Chatterjee
Published: 2023
Adversarial examples are not easily detected: Bypassing ten detection methods
N. Carlini, D. Wagner
Published: 2017
Adversarial attack vulnerability of medical image analysis systems: Unexplored factors
Gerda Bortsova, Cristina González-Gonzalo, Suzanne C Wetstein, Florian Dubost, Ioannis Katramados, Laurens Hogeweg, Bart Liefers, Bram van Ginneken, Josien PW Pluim, Mitko Veta
Published: 2021
A block gray adversarial attack method for image classification neural network
C. Li, C. Fan, J. Zhang, C. Li, Y. Teng
Published: 2022
On the robustness of large multimodal models against image adversarial attacks
Xuanming Cui, Alejandro Aparcedo, Young Kyun Jang, Ser-Nam Lim
Published: 2023
Ensemble adversarial training: Attacks and defenses
Tramer, F., Kurakin, A., Papernot, N., Goodfellow, I. J., Boneh, D., McDaniel, P. D.
Published: 2018
Adversarial attack versus a bio-inspired defensive method for image classification
O. Garcia-Porras, S. Salazar-Colores, E.U. Moya-Sanchez, A. Sanchez-Perez
Published: 2023
Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples
Anish Athalye, Nicholas Carlini, David Wagner
Published: 2018
Adversarial attacks and defenses in image classification: A practical perspective
Y. Chen, M. Zhang, J. Li, X. Kuang
Published: 2022
Defense against adversarial attacks using image label and pixel guided sparse denoiser
M. Li, C. Cao
Published: 2022
Feature squeezing: Detecting adversarial examples in deep neural networks
Xu, W., Evans, D., Qi, Y.
Published: 2018
GAN-based classifier protection against adversarial attacks
S. Liu, M. Shao, X. Liu
Published: 2020
Proc. of the 36th International Conf on Machine Learning
J. Cohen, E. Rosenfeld, Z. Kolter
Published: 2019
Defense mechanism against adversarial attacks using density-based representation of images
Y.-T. Huang, W.-H. Liao, C.-W. Huang
Published: 2021
Adversarial attacks and defenses in deep learning
K. Ren, T. Zheng, Z. Qin, X. Liu
Published: 2020
Towards robust image classification using sequential attention models
D. Zoran, M. Chrzanowski, P.-S. Huang, S. Gowal, A. Mott, P. Kohli
Published: 2020
Gradient-based learning applied to document recognition
Y. Lecun, L. Bottou, Y. Bengio, P. Haffner
Published: 1998
Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms
H. Xiao, K. Rasul, R. Vollgraf
Published: 2017
Very deep convolutional networks for large-scale image recognition
K. Simonyan, A. Zisserman
Published: 2015
Towards Evaluating the Robustness of Neural Networks
Nicholas Carlini, David Wagner
Published: 2016.8.17
Share