Adversarially Robust Classification based on GLRT

TOP 文献データベース Adversarially Robust Classification based on GLRT

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2011.07835

PDF

https://arxiv.org/pdf/2011.07835

文献情報

作者: Bhagyashree Puranik,Upamanyu Madhow,Ramtin Pedarsani
公開日: 2020-11-16
所属機関: University of California Santa Barbara
所属の国: United States of America
会議名: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

AIにより推定されたラベル

敵対的攻撃パフォーマンス評価ウォーターマーキング

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Machine learning models are vulnerable to adversarial attacks that can often cause misclassification by introducing small but well designed perturbations. In this paper, we explore, in the setting of classical composite hypothesis testing, a defense strategy based on the generalized likelihood ratio test (GLRT), which jointly estimates the class of interest and the adversarial perturbation. We evaluate the GLRT approach for the special case of binary hypothesis testing in white Gaussian noise under $\ell_{\infty}$ norm-bounded adversarial perturbations, a setting for which a minimax strategy optimizing for the worst-case attack is known. We show that the GLRT approach yields performance competitive with that of the minimax approach under the worst-case attack, and observe that it yields a better robustness-accuracy trade-off under weaker attacks, depending on the values of signal components relative to the attack budget. We also observe that the GLRT defense generalizes naturally to more complex models for which optimal minimax classifiers are not known.