Defending against Adversarial Images using Basis Functions Transformations

TOP 文献データベース Defending against Adversarial Images using Basis Functions Transformations

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1803.10840

PDF

https://arxiv.org/pdf/1803.10840

文献情報

作者: Uri Shaham,James Garritano,Yutaro Yamada,Ethan Weinberger,Alex Cloninger,Xiuyuan Cheng,Kelly Stanton,Yuval Kluger
公開日: 2025-3-26
所属機関: Yale University
所属の国: United States of America
会議名

AIにより推定されたラベル

敵対的攻撃モデルの頑健性保証ウォーターマーキング

Abstract

We study the effectiveness of various approaches that defend against adversarial attacks on deep networks via manipulations based on basis function representations of images. Specifically, we experiment with low-pass filtering, PCA, JPEG compression, low resolution wavelet approximation, and soft-thresholding. We evaluate these defense techniques using three types of popular attacks in black, gray and white-box settings. Our results show JPEG compression tends to outperform the other tested defenses in most of the settings considered, in addition to soft-thresholding, which performs well in specific cases, and yields a more mild decrease in accuracy on benign examples. In addition, we also mathematically derive a novel white-box attack in which the adversarial perturbation is composed only of terms corresponding a to pre-determined subset of the basis functions, of which a "low frequency attack" is a special case.