Adversarially Robust Training through Structured Gradient Regularization

TOP 文献データベース Adversarially Robust Training through Structured Gradient Regularization

Computing Research Repository (CoRR)

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/1805.08736

PDF

https://arxiv.org/pdf/1805.08736

文献情報

作者: Kevin Roth,Aurelien Lucchi,Sebastian Nowozin,Thomas Hofmann
公開日: 2025-3-25
所属機関: Department of Computer Science, ETH Zürich
所属の国: Switzerland
会議名: Computing Research Repository (CoRR)

AIにより推定されたラベル

モデルの堅牢性敵対的攻撃検出損失関数

Abstract

We propose a novel data-dependent structured gradient regularizer to increase the robustness of neural networks vis-a-vis adversarial perturbations. Our regularizer can be derived as a controlled approximation from first principles, leveraging the fundamental link between training with noise and regularization. It adds very little computational overhead during learning and is simple to implement generically in standard deep learning frameworks. Our experiments provide strong evidence that structured gradient regularization can act as an effective first line of defense against attacks based on low-level signal corruption.